SEQUENCE LISTING 




<110> KOSAN BIOSCIENCES, Inc. 
REEVES, CHRISTOPHER 
CHU, DANIEL 
KHOSLA, CHAITAN 
SANTI, DANIEL 
WU, KAI 

<120> POLYKETIDES ENCODING THE fkbA GENE OF THE FK-520 
GENE CLUSTER 

<130> 30062-20026.11 

<140> 09/940,316 
<141> 2001-08-27 

<150> 09/410,551 
<151> 1999-10-01 

<150> US 60/139,650 
<151> 1999-06-17 

<150> US 60/123,810 
<151> 1999-03-11 

<150> US 60/102,748 
<151> 1998-10-02 

<160> 72 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 77536 

<212> DNA 

<213> Streptomyces hygroscopicus 

<220> 
<221> CDS 

<222> (52275) . . . (71465) 
<400> 1 

gatctcaggc atgaagtcct ccag^cgagg cgccgaggtg gtgaacacct 
tgtacggacc acttcagtca gcggcgafctg cggaaccaag tcatccggaa 
ttacaagatc ctcacattgc gcgaccgcca gcatacgctg agttgcctca 
gaaagggcgc gggcggtccg caccagggcg gagtacgcga cgagagtggc 
accgtcacct ctctcccccg ccggcgggat gcccggcgtg acacggttgg 
acgctgaaca cccgcgcggt gtggcgtcgg ggacaccgcc tggcatcggc 
tacggggagg gcgtacggcg gccgtggctc gtgctcacgg ccgccgggcg 
gagacggcac tcggcgagca gggacgcctg gtcggcacct gcgggccgga 
gttcgcgggc gggcggtggc cggtggtgag ccagctctcc agggcggtga 
gtgacacggc agcaaaggcc ggagtcggtc ggggaaggtg tcgacgaggg 
cgtgccgtcc tcgatgcggt agtagcggta ccggccgcca ggccgctgcc 
gcgtacacgt cggagcccgg gcggcaggca gcagcacgtc gagagtgcct 
cagcggcttg ccgatacgac cggtcaacgc gatgcgttcc acggccgcgt 
ggagcgggtg gcgtagtcgt agtcggcatc gcagcccggg accgtccccg 
cggtgtgccg gcttccttct ccccatcgaa gccggggtcg aactcctcgc 
ctgcgtcaga tcccagtaga cctcgtggtg gtacggccac aagaactcgg 
gaacccggcg cggagcagcg cctcgcgcgc ctggccggct gcggggccgc 
ggtggggtag tcgcgcaggg cggccggcag gaaggtgaag aggttgggac 
ccacagggtg ccttcccagt cgactcctcc gtcgtacagc tcgggatggt 




POLYKETIDE SYNTHASE 



1 



cgccgctgct 


60 


taaagggcgg 


120 


gaggcaaacc 


180 


gcacccgcgc 


240 


gctctcctcg 


300 


cgggtgacgg 


360 


gtcatccgtc 


420 


cgaccgtgtg 


480 


aggctgagcg 


540 


cgtcggtgtg 


600 


ggacatacgc 


660 


ggatggtgat 


720 


ggacgccgga 


780 


gggcgcaata 


840 


ggtagacgcg 


900 


agtcggccgg 


960 


ctgccgcgta 


1020 


cctccgcgcg 


1080 


tctccagctg 


1140 
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SEQUENCE LISTING 

<110> REEVES, CHRISTOPHER 
CHU, DANIEL 
KHOSLA, CHAITAN 
SANTI, DANIEL 
WU, KAI 

<120> POLYKETIDE SYNTHASE ENZYMES AND RECOMBINANT DNA 
CONSTRUCTS THEREFOR 

<130> 30062-20026.00 

<140> 09/410,551 
<141> 1999-10-01 

<160> 71 

<170> Patentin Ver. 2.0 

Q <210> 1 

<211> 77536 
^fj <212> DNA 

<213> Streptomyces hygroscopicus 



"T' <400> 1 

gatctcaggc atgaagtcct ccaggcgagg cgccgaggtg gtgaacacct cgccgctgct 60 

2 tgtacggacc acttcagtca gcggcgattg cggaaccaag tcatccggaa taaagggcgg 12 0 

ttacaagatc ctcacattgc gcgaccgcca gcatacgctg agttgcctca gaggcaaacc 180. 

B gaaagggcgc gggcggtccg caccagggcg gagtacgcga cgagagtggc gcacccgcgc 240 

Q accgtcacct ctctcccccg ccggcgggat gcccggcgtg acacggttgg gctctcctcg 3 00 

^ acgctgaaca cccgcgcggt gtggcgtcgg ggacaccgcc tggcatcggc cgggtgacgg 3 60 

ry tacggggagg gcgtacggcg gccgtggctc gtgctcacgg ccgccgggcg gtcatccgtc 420 

gagacggcac tcggcgagca gggacgcctg gtcggcacct gcgggccgga cgaccgtgtg 4 80 

^ gttcgcgggc gggcggtggc cggtggtgag ccagctctcc agggcggtga aggctgagcg 540 

^ gtgacacggc agcaaaggcc ggagtcggtc ggggaaggtg tcgacgaggg cgtcggtgtg 600 

^ cgtgccgt cc tcgatgcggt agtagcggta ccggccgcca ggccgctgcc ggacatacgc 660 

gcgtacacgt cggagcccgg gcggcaggca gcagcacgtc gagagtgcct ggatggtgat 72 0 

cagcggcttg ccgatacgac cggtcaacgc gatgcgttcc acggccgcgt ggacgccgga 780 

ggagcgggtg gcgtagtcgt agtcggcatc gcagcccggg accgtccccg gggcgcaata 84 0 

cggtgtgccg gcttccttct ccccatcgaa gccggggtcg aactcctcgc ggtagacgcg 900 

ctgcgtcaga tcccagtaga cctcgtggtg gtacggccac aagaactcgg agtcggccgg 960 

gaacccggcg cggagcagcg cctcgcgcgc ctggccggct gcggggccgc ctgccgcgta 102 0 

ggtggggtag tcgcgcaggg cggccggcag gaaggtgaag aggttgggac cctccgcgcg 10 8 0 

ccacagggtg ccttcccagt cgactcctcc gtcgtacagc tcgggatggt tctccagctg 114 0 

ccagcgcacg aggtagccgc cgttggacat cccggtgacc agggtgcgct cgagcggccg 12 00 

gtggtagcgc tgggcgaccg acgcgcgggc ggcccgggtc agctgggtga ggcgggtgtt 12 60 

ccactcggcg acggcgtcgc ccggccggga gccatcacgg tagaacgcgg ggccggtgtt 13 2 0 

gcccttgtcg gtggcggcgt aggcgtaacc gcgggcgagc acccagtcgg cgatggcccg 13 8 0 

gtcgttggcg tactgctcgc ggttaccggg ggtgccggcc acgaccaggc caccgttcca 1440 

gcggtcgggc agccggatga cgaactgggc gtcgtggttc cacccgtggt tggtgttggt 15 0 0 

ggtggaggtg tcggggaagt agccgtcgat ctggatcccg ggcactccgg tgggagtggc 15 60 

caggttcttg ggcgtcagcc ctgcccagtc cgccgggtcg gtgtggccgg tggccgccgt 162 0 

tcccgccgtg gtcagctcgt ccaggcagtc ggcctgctga cgtgccgccg ccgggacacg 1680 

cagctgggac agacgggcgc agtgaccgtc cggggcatcg ggagcaggcc gggccgtggc 1740 

cggtgagggg agcaggacgg cgactgcggc cagggtgaga gcgccgaggc cggtgcgtct 18 00 

tctcggggcc cgtccgacac cgaggggcag aaccatggag agcctccaga cgtgcggatg 1860 

gatgacggac tggaggctag gtcgcgcacg gtggagacga acatgggtgc gcccgccatg 192 0 



i 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



actgaggccc ctcagaggtg ggccgccgcc atgacgggcg cgggaccgcg gg^S^^ccgg 
ggcigtgccc gcggccgcca ccggttccgg gtccccgggt cagggacagg tgtcgttcgc 
qLqgtgaag tagccggtcg gcgactcttt caaggtggtc gtgacgaagg tgttgtacag 
gcccKStc tglccSagl ccttggcgta ggtgtaaccg gcgctcgtcg tggcgcggcc 
cgcc?g|acg tSgc|tagt tgccggcggt ccagcagacg gccgtggcac cggtcgtctg 
cjcgg?lacc gcglccgaga gcggtccggc cttgccgtcc gcgtcccggg cggcgaccgc 
oLggtgtgc gatgtgcccg ccctcaggcc ggtgtccgtg tacgacgtcg tggcggacgt 
Ltgatctgg gcaccgtcgc ggtggacggc gtagtcggtg gcgccgtcga cgggtttcca 
. Iltcaggctg atggtggtgt cggtggcgcc ggtggcggcc aggccggacg ^^f^'^^^l^^ 

SaaccLgg tcggaggcgg atccgctcag gccgaagaac tgcgtgatcc agtagctgga 2520 

acagatcSI tcSgSaS aggcggcgcc ggtgctgccg cactgctgtg -^--^f^^l ^580 

JggLcgLc ggggtgccgt gcccgatgcc cggcacccgg ttcacctcca cggccaccga 2640 

fccgtccgcg g?caggtact cctcgtgccg ggtggagttc gggccgatca ccgaggtacg 2700 

qtccggcgtc ?gggacacgc cgtgcacagc ggtccactgg tcgcgcaact cgtcggcgtt 2760 

gcgcllclcg aciltggtit ccttgtcgcc gtgccagatg gccacgcgcg gccacgggcc 2820 

cglcclclal ggi^aiccgt cacggacccg ccgcgcccac tggtccgcgg ^caggtcggt 2880 

cccgggg^tc Sgcacaggt acgcgctgct gacgtcggtg gcacagccga agggcaggcc 2940 

ggcSSacc gc^ccggcct ggaagacgtc cggataggtg gcgagcatca ccgacgtcat 3000 

g|clcc|ccg gcggacagcc cggtgatgta ggtgcgctgg gggtccgcgc cgtaggcgga 3060 

Scggtgtga gcggccatct gccggatcga cgcggcttcg ccctggcccc tgcggttgtc 3120 

a Ictlctctig aaccagttga agcacctgtt cgcgttgttc gacgacgtgg tctcggcgaa 3180 

^fi caclagcall aagccatagc ggtccgcgaa tgagagcagg ccggagttgt cggcgtagcc 3240 

^ ctglgcgtcc tglgtgcaac cgtgcagggc gaacaccacc gccggctccg cgggcaggga 3300 

£ cglllgccgg tISc|taca tgttcagccg gcccgggttc gtgccgaagt ccgcgacctc 3360 

D gItSggtcc gclttigtca gaccgggctt ggccaggccc gccgcggcgt gg9<=^g^-93 ^^^0 

3 cIccgSccg Igcagggccg ctccgagtac gagggccacg acggccacga gacgggtgag 3480 

'r. ciccccccgc cgtcccggac gcgacaacga cccgaccggc ggcgaggagg agagggggaa 3 540 

C cagcgggg?g a^gattcccc ggaacggcgg cggctgcatg gcggctccct cgatgtcgtg 3600 

^' gSSScac gSgggctcc ctgacgtcga tcagtgggag cgccccggtg ^^cggcaccg 3660 

?Sg|tggt ?caacccgca acggtatggc ccggagcacc acaccccgca ^cgcgcgatg 3720 

W tgScccSa cggattgtgt cgccttgcgg aatctgatac ccggacgcga cgaacgcccc 3780 

» acccqacacg ggtagggcgt catggtgtcc gactcggccg gtcggccttg cctgccctgg 3840 

m acgglccSS Stclicgla ccgggcgtcg gcgggctggg cggtatggcg gccgaggacg 3900 

^-1 cclqccgcg? g|ggc|gccg cgcccaagtg cagtacgccg accgtggccg gcgggagggc 3960 

6 cgglcclgtc S?|cl|tcc cgcggccctg cgggaccgct cgtcccagac ggs^tccacc 402 

2 gclgcgScc gigltccgtg tccgcggcgg tagaccatca gtgtccgctc S^aggCgatg 408 

Icaatqacac cgtcctggtt gtagccgatg gtgcgcacgc tgatgatgcc tacgtcaggt 414C 

cgSScgg actcccggt Sttcaggacc tcggactgcg agtagatggt 9tcgccctcg 420 

alqaccggS tcggcagcct gacccggtcc cagccgaggt tggccatcac atgctgggag 426 

a?|tcg|?|a cgc?ct|ccc ggtgaccagg gcgagggtga aggtggagtc caccagcggc 432 

?tqcc5cagg tggtgcccgc cgagtagtgg cggtcgaagt gcagcggcgc ggtgttctgc 438 

gtcaggagS tUglcagla gttgtcggtc tccaggaccg tgcggcccag ggggtggcgg 444 

?acacqtcgc cggtggtgaa gtcctcgaag tagcggccct gccagccctc gaccacagcg 450 

gtgcglgtig cSc??g|tc ?gggttctca gtcgtcatgg cgctcattct ggg^agtccc 456 

cg|tcc|c?g tiaaatgccg aaccttcacc gggctcatac gtgcggcgca tgagccctgg - 462 

accgtacgtl g?cgtagaac ctcgccacca ctggcgcgcg tggtcctccg S^g^g^gtga 468 

ccacgcclac cgtlcgccgc gcctgcgggt cgtcgagcgg cacggcgacg S^g^ggtcac 474^ 
cgggcccgga cgggctgccg gtgagggggg cgacggccac accgaggccg g^gg^g^^^^ 

qqqcccgSg cgtgctcagc tcggtgctct ccaggacgac ccgcggcacg aatccggccg 486 

cScgcIcal ccgltcggfg atctggcgca gtccgaagac cggctccagt gccacgaacg 492 

c?tc!tcgg? calctcScg gtccgcaccc ggcggcgtct ggccagccgg tgtccgggtg 498 

ggacgagcag gcLagtlcc Lgtcccgca gtggtgtcca ctccacatcg tccccggcgg 504 

1^^^ If Igtcagcccc aggtccagcc tgctgttgcg gacgtcgtcg accacggcgt 510 

cggcglStc Iccgclcagt tcgaaggtgg tgccgggagc cagccggcgg tacccggcga 516 

SaSclgg Lclagccag gtgccgtagg agtgcaggaa acccagtgcc acggtgccgg 522 

Stclgggtc gatcagggcg gtgatgcgct gctcggcgcc ggagacctca ctgatcgcgc 528 

qcagqgcgtg |gcgcg|aag acctcgccgt acttgttgag ccggagccgg ttctggtgcc 534 

|gt?|SS| Sglaficcl actcgtcgct ccagccgccg gatggccctg gacagggtcg 540 



tcgcccgcgg cggtggcgat cgtggcgccg atcagcgtcg cgttcgccgt caggcaccgc 
^ atcgatccgc tgtacgccgg actgatggcg gtgaacgggg ccgcagccgg cagtttcgcc 

® ccctccggga tcctgggcgg catcgtccac tcggcgctgg agaagaacca tctgcccgtc 

rU agcggcgggc tgctcttcgc aggcaccttc gccttcaacc tggcggtcgc cgcggtgtca 

%l tggctcgtcc tcgggcgcag gcgcctcgaa ccacatgacc tggacgagga caccgatccc 

H acggaagggg acccggcttc ccgccccggc gcggaacacg tgatgacgct gaccgcgatg 

U gccgcgctgg tgctgggaac cacggtcctc tccctggaca ccggcttcct ggccctcacc 

ttggcggcgt tgctggcgct gctcttcccg cgcacctccc agcaggccac caaggagatc 
gcctggcccg tggtgctgct ggtatgcggg atcgtgacct acgtcgccct gctccaggag 
ctgggcatcg tggactccct ggggaagatg atcgcggcga tcggcacccc gctgctggcc 
gccctggtga tctgctacgt gggcggtgtc gtctcggcct tcgcctcgac caccgggatc 
ctcggtgccc tgatgccgct gtccgagccg ttcctgaagt ccggtgccat cgggacgacc 
ggcatggtga tggccctggc ggccgcggcg accgtggtgg acgcgagtcc cttctccacc 
aatggtgctc tggtggtggc caacgctccc gagcggctgc ggcccggcgt gtaccagggg 
ttgctgtggt ggggcgccgg ggtgtgcgca ctggctcccg cggccgcctg ggcggccttc 
gtggtggcgt gagcgcagcg gagcgggaat cccctggagc ccgtttcccg tgctgtgtcg 
ctgacgtagc gtcaagtcca cgtgccgggc gggcagtacg cctagcatgt cgggcatggc 
taatcagata accctgtccg acacgctgct cgcttacgta cggaaggtgt ccctgcgcga 
tgacgaggtg ctgagccggc tgcgcgcgca gacggccgag ctgccgggcg gtggcgtact 
gccggtgcag gccgaggagg gacagttcct cgagttcctg gtgcggttga ccggcgcgcg 
tcaggtgctg gagatcggga cgtacaccgg ctacagcacg ctctgcctgg cccgcggatt 
ggcgcccggg ggccgtgtgg tgacgtgcga tgtcatgccg aagtggcccg aggtgggcga 
gcggtactgg gaggaggccg gggttgccga ccggatcgac gtccggatcg gcgacgcccg 
gaccgtcctc accgggctgc tcgacgaggc gggcgcgggg ccggagtcgt tcgacatggt 
gttcatcgac gccgacaagg ccggctaccc cgcctactac gaggcggcgc tgccgctggt 
acgccgcggc gggctgatcg tcgtcgacaa cacgctgttc ttcggccggg tggccgacga 
agcggtgcag gacccggaca cggtcgcggt acgcgaactc aacgcggcac tgcgcgacga 
cgaccgggtg gacctggcga tgctgacgac ggccgacggc gtcaccctgc tgcggaaacg 
gtgaccgggg cgatgtcggc ggcggtcagc gtcagcgtcg tcggcgcggg cctcgcggag 



5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 



Qctqggagat gttgagccgt tccgcggtga tcgtcacgtg ctcgtgctcg gccaaggccg 
tgaaccactg caactcccgt atctccatgc agggactata cgtaccgggc atggtcctgg 
cgaggtttcg tcatttcaca gcggccgggc ggcggcccac agtgagtcct caccaaccag 
gaccccatgg gagggacccc atgtccgagc cgcatcctcg ccctgaacag gaacgccccg 
ccgggcccct gtccggtctg ctcgtggttt ctttggagca ggccgtcgcc gctccgttcg 
ccacccgcca cctggcggac ctgggcgccc gtgtcatcaa gatcgaacgc cccggcagcg 
gcgacctcgc ccgcggctac gaccgcacgg tgcgtggcat gtccagccac ttcgtctggc 
tgaaccgggg gaaggagagc gtccagctcg atgtgcgctc gccggagggc aaccggcacc 
tgcacgcctt ggtggaccgg gccgatgtcc tggtgcagaa tctggcaccc ggcgccgcgg 
gccgcctggc atcggccacc aggtcctcgc gcggagccac cgaggctgat cacctgcgga 
catatccggc tacggcagta ccggctgcta ccgcggaccg caaggcgtac gacctcctgg 
tccagtgcga agcggggctg gtctccatca ccggcacccc cgagaccccg tccaaggtgg 
gcctgtccat cgcggacatc tgtgcgggga tgtacgcgta ctccggcatc ctcacggccc 
tgctgaagcg ggcccgcacc ggccggggct cgcagttgga ggtctcgatg ctcgaagccc 
tcggtgaatg gatgggatac gccgagtact acacgcgcta cggcggcacc gctccggccc 
gcgccggcgc cagccacgcg acgatcgccc cctacggccc gttcaccacg cgcgacgggc 
agacgatcaa tctcgggctc cagaacgagc gggagtgggc ttccttctgc ggtgtcgtgc 
tacaacgccc cggtctctgc gacgacccgc gcttttccgg caacgccgac cgggtggcgc 
accgcaccga gctcgacgcc ctggtgagcg aggtgacggg cacgctcacc ggcgaggaac 
tggtggcgcg gctggaggag gcgtcgatcg cctacgcacg ccagcgcacc gtgcgggagt 
□ tcagcgaaca cccccaactg cgtgaccgtg gacgctgggc tccgttcgac agcccggtcg 

gtgcgctgga gggcctgatc cccccggtca ccttccacgg cgagcacccg cggcggctgg 
gccgggtccc ggagctgggc gagcataccg agtccgtcct ggcgtggctg gccgcgcccc 
^ acagcgccga ccgcgaagag gccggccatg ccgaatgaac tcaccggagt cctgatcctg 

Q gccgccgtgt tcctgctcgc cggcgtacgg gggctgaaca tgggcctgct cgcgctggtc 

Ld gccacctttc tgctcggggt ggtcgcactc gaccgaacgc cggacgaggt gctggcgggt 

U ?tccccgcga gcatgttcct ggtgctggtc gccgtcacgt tcctcttcgg gatcgcccgc 7020 

In gtcaacggca cggtggactg gctggtacgt gtcgcggtgc gggcggtggg ggcccgggtg 7080 

ggagccgtcc cctgggtgct cttcggcctg gcggcactgc ^ctgcgcgac ^^^^^cggcc 714 0 

7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 



n 

m 



ggctccagat 

gggcagtcgg 

tccgtacgcc 

cagttcagga 

ttcaggtgcc 

ccgaagcggt 

gcaggtcggc 

gttcctcgac 

gcgagcgcag 

aacccgcctg 

actccggcag 

ggtggaacgc 

gtgcgaagtt 

tgcgggccag 

cgtggtcgtt 

cctcgcggat 

aggtgttgtc 

gggagcgcca 

atctccatga 

gccgacgcga 

tgcttggcca 

gcgtactcgc 

gcgacgagtt 

accgcggcgg 

ccgtaggcga 

gcgccggccg 

ccggacggct 

accgcaccgg 

gcagtcgtcc 

gtccgcatcg 

ttcccgctgg 

acggtccacg 

ccgacgtgtg 

gccgccactt 

agttcgccgg 

tcacgctcag 

acggaagttc 

gtacacgacc 

gtccacgggc 

ggggtcgtcc 

ccggtcttcc 

ctgcgctcgt 

ccgatcaggt 

gcgtcgacgt 

atcgggtgga 

cgccgcagcg 

ccgcggatca 

aggtcctcgg 

gtgatgaccg 

tcggcgtcct 

gacgtggccg 

gtccgcagtt 

acgagtgtca 

cccgcgccga 

gcagcgagta 

ggccgagttc 

tgcccgtcga 

cgcacagggc 



gcaggcgttc 

agtccgcgaa 

ggaagtccgc 

tcgtcgcacc 

acgtcgacgg 

cgcccatggt 

gtcggagtag 

gcggctgagt 

gaagtcctcg 

gtacatcagg 

cgacaggagc 

cacctcggca 

cagctccgtg 

cacgaagtac 

cttgctcgcc 

ctcgtcggtg 

caggtcccag 

gcgcgtgctg 

gcttggcgtc 

gcacctgtgc 

ggatcgtcgc 

acacgcgggc 

ggtggtcgcc 

tgcggcaggc 

gtgacgccgc 

gcacacgcac 

tcgggacgcg 

aaccatcctc 

agaccttgtg 

ccgacagatc 

tcagctcctt 

cggccatgcc 

cggtgaactc 

ccgcgcagag 

acgtgtccca 

gcatcgacgg 

gcgagctgga 

agttccatcg 

cagtccgacc 

ttgacgggtg 

ggccgtggtg 

cgccggtgcg 

ccgcggtgcg 

cctcgacgga 

gcagccggct 

ccgcgagcag 

cctcgaccgt 

gccgggccac 

ggataccggg 

cgacgacggc 

tccgcagcac 

cggtggcgat 

ccgggacgcc 

gcacgatcag 

cgggtcgagg 

gtcggcgaag 

gtcgaggacg 

cgccagcgac 



gacgccggcg 

gcccgcgaac 

caccaggtgc 

ggcaccgaac 

cttcttctcc 

gacgacgagg 

tgcacgccgg 

tcctcctccc 

tcgggaccgg 

cggcgccgac 

9tggccgcct 

cgctcggcgg 

gcgatctcgc 

tccgccacac 

accgcctgga 

aggaccacct 

accagacact 

ggccagcatc 

gcggtacgcc 

ggcggtcgcg 

gggcaccatc 

cgcgatctgc 

gagcggccgg 

ccgcaggatc 

gaccagcatc 

ctggtccagg 

ctcgacgcgt 

ctggagaccg 

gccgtcgacg 

gctgcccgcc 

caggaaggtc 

ctgcgacgtc 

gccgttctcc 

caggccgtcg 

ctcggcggcc 

cccgcagccg 

ggtccgggcc 

cgaacagcga 

tggtcttcgt 

cggtcatgag 

tccctcgcgg 

tttgtgcagc 

cagcggcccg 

cgcggtgccc 

cgtgacgaag 

gtccccggcg 

cgggatcagg 

ggagtcggcc 

cgccgctgcc 

ctcgatcacc 

accggggtcg 

ccgcgcccgc 

gtggcgcagc 

ctggtggtcc 

acgtcttccg 

ccgagcagca 

ctcaggctgt 

gggccgagct 



gcggaagcgc 

cggtaggcga 

gcccccgcgc 

gacacgaccc 

agcaggatga 

acctcatggg 

tcgcgttcat 

ccgcgggtgc 

agtacgcctc 

gcgagtcgac 

gctcggccgg 

gctggtcgtc 

ggacggactg 

cgaggcgttc 

ggatgccgcg 

cgtcgtcctc 

tgacaatggt 

acccggcaca 

cgttcgacga 

gccccggcgg 

tcgggcgagc 

tccgcggtcc 

ccgaactgct 

ccgacgcagc 

ggcagtgacg 

tgcagatcgg 

acgccggggg 

aagacgacca 

acagcggtgt 

tgccgctcac 

gcccgctgac 

atgacactgc 

cggctgccga 

gcgccgagcc 

cggtcaccga 

gtggacgagt 

ggcgatcgtg 

cgtgaggccg 

cttgaggaac 

aacaccttct 

accttgccca 

acccacagcg 

gtcggatggc 

tcctgcacga 

ccgggcgcgt 

gcggccatgg 

tacgacgggt 

agttcgtcaa 

gagaccgtgg 

gcggtggccg 

gcctcggcgg 

gccgccgtaa 

gcgagcgtgg 

acgctgtttc 

gggtcgaccc 

cgtcgaacgc 

cccggtggtc 

cgcggtccgg 



ccgccacctc 

tctccatcat 

gggcgccctg 

ggcaggacgt 

tgccgacggc 

cgggatcggt 

ctggctggtc 

gatcgtcatg 

ccgggcctgg 

cgtggacacc 

gtagcaccgc 

gatgaacgcg 

cgacttcggc 

cagacgctcc 

gtcgtcgagc 

cagcacggtg 

catggctgtc 

tctcgctgct 

cgtgtccctc 

cggctcgttc 

cctcgtccca 

acaggtcggc 

cccgggtccg 

cccaggcgac 

cgccggagcc 

cgtggccggc 

tgtcggcggg 

ggtggtccgc 

ccccgtcgag 

tgaagccgac 

cggcgtcgcc 

gcagcgaact 

gtcccagacc 

ggacgagcag 

caaggtcggt 

gcgaccatgg 

acgtcgaacg 

ccctccgcga 

gcgaccaacg 

cgtattcgta 

gcagcaggtc 

cgtcgacgag 

cgaggcaccc 

tccgcgccgc 

cccggacgac 

ccttctcacc 

tcatgaagtg 

ccgggatcga 

cgagtacctc 

taccgatcgc 

gcccggccac 

ggatctcctc 

tgatgccggt 

ctccctccgg 

gatcgcgtcc 

gatgtggtcg 

cgccgcggtg 

cagttgctgg 



ggacacgcag 

gcggttgcgg 

gtccgtgagc 

ggcgagcagt 

gccgtgcggg 

gagcacgcgc 

cgcagcgtca 

gagaggtcga 

tcgcgcgcga 

ggcgggctga 

acctcgggca 

atcgtggtcg 

ccccatccga 

cacgcgaggt 

gtggtgatca 

ccccgccaca 

ctctcaagcc 

gccctcgatg 

tctcgcgcct 

99cggcgacg 

gtggtcgctg 

gatgtgcccg 

ggcgtgggcc 

cgacttgcgc 

ggccaggacc 

ggcgcggcag 

cacgaccacc 

gtaggcggcg 

ccgaacccgc 

ggccgcgagt 

gagccgctgc 

gcagaggctg 

gccgtgctcg 

gtcgcgcggc 

cagcagcgcg 

actcgacggt 

tcttctccag 

acaggtcgcg 

cgtgcgcgac 

gaagccccgg 

acaggggcgg 

gttgtcgatg 

cgtcatgagc 

gtcgttgatc 

gatcggcttg 

ggtccggggt 

cgtgccgagc 

cgacgtgttc 

cgccttgacc 

gggcagcgcg 

gagttgtgcc 

ggacgtgtcg 

gcccatcact 

ggtcaccatg 

ttgcggccga 

gcgaacgcgc 

tccggtgccg 

tactcgccct 



8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
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w 



m 
m 



cggcgcgggc 

gcagttcggt 

gcggctcggc 

catccgcggc 

cgatgcggtc 

cgatgggcag 

tcagcgtgag 

cgccggtccg 

cgtcgtcgag 

gcggactgta 

actcgatgac 

cgaactcgcc 

tcatcacgtc 

tggtctgcat 

cggcttccgt 

gtccgcggac 

cagggcgtcc 

aacgagtgct 

cagcagttca 

ggcggacagt 

cgagtccaca 

gcccaggacg 

ctcgttgcgg 

cggcggcagg 

gtacatgcgc 

ccggtcggcg 

gatcgacagc 

gttcgccgcc 

gacgacgaat 

ggccttgggt 

gtcgagggtt 

ggtggcgagt 

ggggggtggg 

gccggcgagg 

C999accgtg 

gcggacctgc 

caggccgagc 

gaacggtcgc 

cggcgcgagc 

gtcgaccggc 

gtccaggtcc 

gtgccgcgcg 

cttctcgccg 

ggtcatcacg 

gtggtcggcg 

cggtgccaga 

gagcacgccc 

cgccgcacgc 

gtcggccgcg 

gctgtccggc 

gccgcgcagc 

gagcgtgacg 

ggcgcgcagc 

atccccgccg 

caccgggtcg 

atccgtgggt 

ggacagcggg 

gacgcgcaga 



ctgccccgga 

cttgcccggc 

gggcagcacc 

ggcggcggcc 

cgcgaacgac 

gaccctgctg 

cgtggcgctg 

catcgcggtg 

gcgcgacatc 

cgaaaccgtc 

gccgggaatg 

gcggccgagc 

gcggccgatc 

gtgtcacctc 

tctcatcgca 

agcacgccgg 

agccgggttc 

tccagccggt 

ccgatgcggt 

cgcagaccgg 

ccgagttccc 

gccgctgcct 

gcgctccggc 

tcgcccgcca 

atgccctgtt 

tcggtcaggt 

cctggcagcc 

gcgtagttgc 

gcggcgaggt 

ttgaggacgg 

ccggcggtgt 

tggtgggggt 

gtgcgggaga 

gtgccggagc 

aggacgatct 

cgcatgtcgt 

agctccgcga 

tggacggcgt 

aggccgacgg 

gggaacgcgt 

accagatggc 

atctgccggg 

gggcgcagcc 

gacgccgcct 

atgaccgtgg 

ccggagacgt 

tgaccggggt 

acaccgatcc 

gtgaggccgt 

acggtgagcg 

cgcagacgcg 

ccggactcgg 

agtccggccg 

gagccggtca 

tcgccatcag 

gcggcgacct 

cgggtgcgga 

ctcagctcgt 



tggtcgacgc 

tcgtcggcgc 

ggccctttgc 

tccgccggat 

gccgcgtggc 

agcgcgtgcg 

tcggaccggg 

atcacgcctg 

gtgccgacga 

ttcatggtca 

tcgccgccgc 

gcggcgaacc 

acggagagaa 

cctttcgtgg 

gctccctgtc 

ccggcgtggt 

cgatcgcgtc 

cgagctgcgc 

cggcgagtgc 

tcgcctcgtt 

ggaacgccgc 

tctgccggac 

gggccgacgg 

cggcgacgac 

cggcggtgag 

ccgcggtcag 

cttgtgcacg 

cctgaccggg 

cggtgtcgcg 

tgtcgatgcg 

ggaagacggc 

cgccgacgtc 

ggaggtaggt 

cgccggtgat 

tgccggtgtg 

gcaccgtcac 

tgatctcctt 

gccggatgtc 

acgcgtcgag 

cggcgaacgc 

gcttcgcggc 

cggcggaacc 

cggcgaggtc 

gcgggaacgt 

ggccgaagcc 

cggcgccggt 

aggtgccgag 

ggacctcggc 

cgagggtgcc 

gctccggcac 

gctcgccgag 

tctcgacgtg 

ccgcgccggt 

gggcggtcag 

cggcaggcaa 

cgatccaggt 

ccgtccggat 

cgccgtcacg 



agatgaacgc 

cgatggcgtt 

ccgagggcac 

cggtcacctt 

cggggtcggt 

cctgggtcac 

ccagcagccg 

cgtcggcgag 

tcgtcggcag 

cgccgacacc 

ggacgaatcc 

cgtcgtgcag 

tccgcttgat 

ccggagctgt 

gatgaggtcg 

cgggcgggtc 

cgcctggcgg 

gagcaccacg 

gcgcggcgac 

gaggccgttg 

gtcctccggg 

gagggcgagc 

cttgggccgg 

actgcccgtt 

cgcgctcgcc 

gccactcgcc 

ccggtgttcg 

ggtgcccagc 

ggtgagccgg 

gtcgggggtg 

ggtgaggggt 

gcaggggagg 

gtgggggtgg 

gacgacggcc 

ctcgccgcgg 

cggcagcggg 

gagccggtcg 

cgtcttcccc 

gagttcaccg 

ggtgctgcgg 

gctggtggtc 

gacaccgccg 

gaccaggccg 

ccagccgtcc 

ggtgccgacg 

ctccaggacg 

cgcgatcagc 

cggggcgagg 

cgtccgcgcc 

ccgggtgagg 

tgcgacggcg 

gacgaaccgg 

ggcgaggccc 

cagccgggtg 

cgtgatgacg 

gagacgcatc 

ctcggcgacg 

agtgatcacg 



gtcgtcgagc 
cacatgcagg 
cgaggtgacg 
gaccggcagt 
gtcgctgacc 
cgcctgtgcg 
gctcgcgacg 
ggcggtcaga 
ccggaagcgc 

ggggacccgg 

ggtacgcggc 

ctcgctgatc 

gtcacgttgg 

cttggtggtg 

aaaatctcgt 

tcccgccgcc 

gcgcccgggt 

gtcaccgggt 

gggtagtcga 

cgcagctgca 

atgtcctccg 

aggtcggtgg 

ccacgcagca 

ccggtgtgga 

ccacccttgc 

tggtcccaca 

gcgagcgcgt 

acaccggccg 

tgcaggtgcc 

aggttgtcga 

tgagggatgt 

tgggtgccgg 

ttcaggtggc 

ccctcggggt 

ctcatggtcg 

tgcagcacac 

ggccccgcgt 

atctcgatga 

gtgagcgagt 

gaatcggcca 

gcgtacacct 

gtggccgcgt 

taccacgcgg 

ggcatccggc 

aggccgaaga 

atgcccgcgg 

acatcgcgga 

gggcgccggg 

ggccggatca 

cgggccgcct 

atgcgctgct 

ccgggctgct 

gcggtggtgt 

gtgagcgcac 

tccacgtcgg 

aggccggtgc 

agttggccgg 

gctcggagca 



agggtcttcg 

tgcggcagcc 

gtggacagga 

ccgaggaacg 

aggatccgct 

cccgcgccga 

gcggcgaccg 

ctgccgctgt 

ggatagttgt 

tacggcatga 

ggcgcctcgg 

agccggtcca 

cgcaggaccc 

ccgctcgggg 

ccgcggtcgc 

agcggttgag 

cgacaccggc 

cgtccgggga 

agacgagcgt 

ccgcgatgag 

ggtcggcgtg 

ggcgttcctg 

gcgggaggtc 

cggcggcgtc 

gcatacggcg 

gcccccacgc 

cgaggaacgc 

ccgacgagta 

aggcggcgtc 

gcagggcgtc 

gggcgagggt 

gggtggtgtc 

gggcgaggat 

ccagcggccg 

ccagcgcctc 

cgcgcgcgaa 

ccatcaggtc 

accggccacc 

tgagcacgac 

gatgcgctcc 

ccgcgcccag 

ggatcaggac 

tcgcgaacgc 

cgagcatccg 

cgcggtcgcc 

cctcgccgcc 

agttgaggcc 

gctccgccga 

gccacgtgtc 

cgaaccggcc 

gctcgggggc 

cggcctgggc 

gcacgagcag 

gcgtctcggc 

tcgcggggac 

cgacgggtgg 

cggagtcggc 

tggccgagcc 



12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 



cgtggcgacg aaccgggccc ccttccaggc gaacggcaga cccgcagcgc tgtcgtccgg 15900 

cgtggtgagg gcgacggcgt gcagggccgc gtcgagcagc gccggatgca caccgaaacc 15960 

g?cSc?tcg icigcctgct cgtcgggcag cgccacctcg gcatacacgg ^Stcaccatc 16020 

Icgccaggca gcccgcaacc cctggaacgc cgacccgtac tcataaccgg catcccgcag 16080 

ttcgtcatag aaccccgaga cgtcgacggc cacggccgtg accggcggcc actgcgagaa 16140 

cggctccaca ccgacaacac cgggggtgtc gggggtgtcg ggggtcaggg tgccgctggc 16200 

gtgccgggtc cagctgcccg tgccctcggt acgcgcgtgg acggtcaccg gccgccgtcc 16260 

gglctcatca gccccttcca cggtcaccga cacatccacc gctgcggtca ccggcaccac 16320 

aaggggggat tcgatgacca gctcgtccac tatcccgcaa ccggtctcgt caccggcccg 16380 

qatgaccagc tccacaaacg ccgtacccgg cagcaggacc gtgccccgca ccgcgtgatc 16440 

agccagccag gggtgagtgc gcaatgagat ccggccagtg agaacaacac caccatcgtc 16500 

gLgggcagc gctgtgacag cggccagcat cggatgcgcc gcacccgtca accccgccgc 16560 

SialSgatcg gtggcaccgg ccgcctccag ccagtaccgc ctgtgctcga ^cgcgtacgt 16620 

gggcagatcc.agcagccgtc ccggcaccgg ttcgaccacc gtgtcccagt ccactgccgt 16680 

gcccagggtc cacgcctgcg ccaacgccgt cagccaccgc tcccagccgc cgtcaccggt 16740 

ccgcaacgac gccaccgtgt gagcctgctc catcgccggc agcagcaccg gatgggcact 16800 

gcactccacg aacaccgacc catccagctc cgccaccgcc gcgtccaacg ^^^^^SSacg 16860 

acgcagattc cggtaccagt acccctcatc caccggctcc gtcacccagg cgctgtccac 16920 

gg^cgaccac cacgccaccg acgcggcctt ccctgccacc ccctccagta ccttggccag 16980 

?tcatcctcg atggcttcca cgtggggcgt gtgggaggcg tagtcgaccg cgatacgacg 17040 

Q cacccgcacg ccttcggcct cataccgcgc caccacctcc tccaccgccg acgggtcccc 17100 

5 cgccaccacc gtcgaagccg ggccgttacg cgccgcgatc cacacaccct cgaccagacc 17160 

S gacctcaccg gccggcaacg ccaccgaagc catcgctccc cgcccggcca gtcgcgccgc 17220 

gatgacctga ctgcgcaatg ccaccacgcg ggcggcgtcc tcgaggctga gggctccggc 17280 

E cacgcacgcc gccgcgatct cgccctggga gtgtccgatc accgcgtccg gcacgacccc 17340 

^ atgcgcctgc cacagcgcgg ccaggctcac cgcgaccgcc cagctggccg gctggaccac 17400 

ctccacccgc tccgccacat ccggccgcgc caacatctcc cgcacatccc agcccgtgtg 174 6 0 

cggcagcaac gcctgagcgc actcctccat acgcgcggcg aacaccgcgg agtgggccat 17520 

gagttccacg cccatgccga cccactgggc gccctggccg gggaagacga acaccgtacg 1758 0 

cggctggtcc accgccacac ccgtcacccg ggcatcgccc agcagcaccg cacggtgacc 17640 

gaagacagca cgctcccgca ccaacccctg cgcgaccgcg gccacatcca caccaccccc 17700 

m gcgcagatac ccctccagcc gctccacctg cccccgcaga ctcacctcac cacgagccga 1776 0 

m caccggcaac ggcaccaacc cgtcaacaac cgactcccca cgcgacggcc caggaacacc 17820 

^Ci ctcaaggatc acgtgcgcgt tcgtaccgct caccccgaac gacgacacac ccgcatgcgg 17880 

n tgcccgatcc gactcgggcc acggcctcgc ctcggtgagc agctccaccg caccggccga 17940 

ccaqtccaca tgcgacgacg gctcgtccac atgcagcgtc ttcggcgcga tcccgtaccg 18000 



m 
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catcgccatg accatcttga tcacaccggc gacacccgcc gccgcctgcg catgaccgat 
qttcgacttc aacgaaccca gcagcagcgg aacctcacgc tcctgcccgt acgtcgccag 
aatggcctgc gcctcgatgg gatcgcccag cgtcgtcccc gtcccgtgcg cctccaccac 
gtccacatcg gcggcgcgca gtccggcgtt caccaacgcc tgctggatga cacgctgctg 
qqacgggccg ttgggggcgg acagcccgtt ggaggcaccg tcctggttca ccgccgaccc 
qcggacgacc gcgagaacgg tgtgtccgtt gcgctcggcg tcggagagcc gctccagcac 
lagaacgccg gcgccctccg cccagccggt gccgttggcg gcgtccgcga acgcgcggca 
qcqgccgtcg ggggagagtc cgccctgctg ctggaattcc acgaacccgg tcggggtcgc 
catgacggtg acaccgccga ccagcgccag cgagcactcc ccgtggcgca gtgcgtgccc 
qgcctggtgc agcgcgacca gcgacgacga gcacgccgtg tccaccgtga acgccggtcc 
ctggagccca tagaagtacg agatccggcc ggtgagcacg ctgggctgca tgccgatcga 
qccgaacccg tccaggtccg cgccgacgcc gtacccgtac gagaaggcgc ccatgaacac 
gccggtgtcg ctgccgcgca gtgtgcccgg cacgatgccc gcgctctcga acgcctccca 
tgtcgtttcc agcaggatcc gctgctgggg gtccatggcc cgtgcctcac gggggctgat 
qccgaagaac gcggcatcga agccggcggc gtcggagagg aagccgccgc ggtccgtgtc 
cgatccgccg gtgaggccgg acgggtccca gccacggtcg gccgggaagc cggtgaccgc 
gtcgccgcca ctgtccacca tgcgccacag gtcgtcgggc gaggtgacgc cgcccggcag 
tcggcaggcc atgcccacga tggccagcgg ttcgtcacgg gtcgcggcgg ctgtgggaac 
agcgaccggt gcggcaccac cgaccagagc ctcgtccaac cgcgacgcga tggcccgcgg 
cgtcgggtag tcgaagacaa gcgtggcggg cagtcggaca ccggtcgccg cggcgagtcg 
gttccgcagt tcgacggcgg tcagcgagtc gatacccagt tccttgaagg ccgcgtccgc 
ggacacgtcc gcggcgtccg cgtggccgag caccgccgcc gcgttgtcgc ggaccagtgc 



18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
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m 

Q 

m 
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cagcagcgcg 

ggcggtggcc 

gtgcgcggtg 

ttccagcagg 

ggtgcggttg 

ggccagcgac 

cccgttcgcc 

gtagaggacg 

gtcggctttg 

gtcgtcgagc 

cgcggcggcg 

cgccggcggt 

atgccgggcg 

cgggtcgagc 

gtaccggccg 

ctcgatgggg 

ggcggaccgg 

gagggtggtc 

ctcggtgagc 

gatgtggacc 

gtacaaggag 

cgcggcgacg 

cgggccctga 

gctccacgag 

gacgtgcaag 

ctgttccccg 

cagtccctgg 

gctcacgtcg 

gcttccggcc 

acgcgcgtgg 

cacatccacc 

caccccgcaa 

cagcagaacc 

ccggccagtg 

catcggatgc 

cagccagtac 

cggttcgacc 

cgtcagccac 

ttccatcgcc 

ctccgccacc 

atccaccggc 

cccgccggaa 

cgtgtgggag 

cgtcaccact 

acgcgccgcg 

agccatcgcc 

gcgggcggcg 

ggagtgtccg 

caccgcgacc 
cgccaacatc 
catacgagcc 
agcaccctgc 
ccgggcatcg 
ctgcgcgacc 
ctgcccccgc 
agccgactcc 
gctcaccccg 
cgcctcggtg 



gtgtcccgct 

gccgccgggc 

aggtccatcg 

cgcatgccca 

gtgccgctca 

agcgcgggca 

gccgagtagt 

aacgagcgca 

gggcgcagtg 

acggctgccg 

agctggtccc 

tcgctgcgcg 

aggagacctg 

agcggttcgg 

tcggtgacgc 

gtgtcggtgc 

acgaggccgg 

tccgcagggc 

cggtacgtct 

gcgtccgcag 

ttccgtacga 

gtcaccaccg 

gtgatcgtga 

aacggcagcc 

gccgcgtcga 

gcgatctcca 

aacgctgggc 

acgcgtcgcg 

cggccgaggg 

acggtcactc 

gcgccggtca 

ccggtctcgt 

gtgccccgca 

agaacaacac 

gccgccccgg 

cgcctgtgct 

accgtgtccc 

cgctcccagc 

ggcagcagca 

gccgcgtcca 

tcggtcaccc 

atcccctcca 

gcgtagtcga 

tcttccaccg 

atccacacgc 

ccccgcccgg 

tcctcaaggc 

accaccgcgt 

gcccagctgg 

tcccgcacat 

gcgaacaccg 

ccgggaaaga 

cccaacaaca 

gcggccacat 

agactcacct 

ccacgcgacg 

aaagcggaga 

agcagttcca 



cagcgccgga 

gcgatacggc 

tggccgccac 

caccggccga 

tgctgccggt 

gtccttcggc 

tgccctggcc 

ggtccgcgtc 

tggtggcgag 

tgtggaagac 

ggtcggcgac 

acagcaacag 

ccagcacacc 

gcgtttccgc 

ggacgtacgg 

cggtctccac 

cgaccgctcc 

cgtcctcggc 

cgtcgaggac 

gaccgggccc 

cggcggcgtc 

gttggccgac 

cgcgcagcgt 

gcacctccgc 

acagcgccgg 

cctcggcgta 

cgtagctgta 

cgcccggcgg 

tgccgctggc 

gccgccgtcc 

ccggcaccac 

caccggcccg 

ccgcgtgatc 

caccaccgtc 

tcagcccggc 

cgaacgcgta 

agtccactgc 

cgccgtcacc 

ccggatgggc 

gcgcgacggg 

aggcgctgtc 

gtacctcggc 

ccgcgatacg 

cggacgggtc 

cctcgaccag 

ccagccgccc 

tgagggctcc 

ccggcacgac 

ccggctggac 

cccagcccgt 

cagaacacgc 

cgaacaccgt 

ccgcacggtg 

ccacaccacc 

cactccgagc 

gcccgggaac 

caccggcccg 

ccgcgccctc 



catggtgccg 

gcggcgcaga 

ggcgaacgcg 

catggggcgg 

gagtccgctg 

atggcgcagc 

gcggccgccc 

ccgggtcagc 

ccgctccggg 

cgccgtgagc 

gtcacagcgg 

gaggtggcgg 

cgagccgccg 

ggcggccgtg 

ctcggccagt 

cagcacgaac 

tccgaccggt 

gatcacccgg 

atccgcgccc 

gggagtgggc 

gccgtcgacg 

cgggtccgtc 

99tggccccg 

ttcctgttcc 

gtggacgcca 

cagggtttcg 

gccggtctcg 

cggccacgcg 

gtgccgggtc 

ggcctcatcg 

gagcggggtc 

gatgaccagc 

agccagccag 

gtcggcgggc 

cgcggacaga 

ggtgggcaga 

cgtgcccagg 

ggtccgcaac 

gctgcactcc 

gcgacgcagg 

caccgtggac 

caactcgtcc 

gcgcactcgc 

ccccgccacc 

gtccacctca 

ggcgatcacc 

ggccacacac 

cccatgcgcc 

cacctccacc 

gtgcggcaac 

catcaactcc 

acgcggctga 

accgaagaca 

cccgcgcaga 

cgacaccggc 

accctcaagg 

gcgcggacgt 

ggtccagtcc 



agccggtcgg 

tcggcgaaaa 

gtgccggttc 

aaaccgccgc 

tcatcggccc 

gtcgcgagtc 

atgatgcccg 

tcgtgcaggt 

gtgagtgccg 

ggcctgccgg 

atgtggacac 

gcgccatgct 

gtgatgacca 

cgggtgaacc 

gtcgtggcgg 

cggcccgggt 

cccgcgtcga 

tgcagctcgc 

ggttccggga 

agctcggtcc 

ttcaccggtc 

gcatgcacgg 

gtcgtgtgga 

gcgagcagcg 

tagtgcggcg 

ccgtcgcgcc 

gccagccgct 

ggcggcggga 

cagctgtccg 

gccccttcga 

tcgatgacca 

tccacaaacg 

ggatgcgtac 

^gtgctgtga 

tcggtggcac 

tcgagcagcc 

gtccacgcct 

gacgccaccg 

acgaacacgg 

ttccggtacc 

caccaggcca 

tcgatggctt 

acgccttcgg 

acagtcgaag 

ccggccggca 

tggctgcgca 

gccgccgcga 

tgccacagcg 

cgctccgcca 

aacgcccgcg 

acacccatgc 

tccaccgcca 

gcacgctcac 

tacccctcca 

aacggcacca 

atcacgtgcg 

cccgcgtcgg 

acatgcgacg 



cgagcggaac 
gcggcgatgt 
cggccgcggc 
ggcggacacg 
3.gaggcccca 
cgtcgaggaa 
cgacggacga 
gccaggcgcc 
tggtcacgcc 
cggcggcgag 
cgggagtgtc 
cggcgacgag 
ccgtgccgtc 
gcggcgcttc 
cggccagcgc 
gctcggcctg 
tccggacgac 
cgagcacgaa 
gcgcggagac 

aggsgaggcc 

gcgcggtcag 
cagcgccgtc 
accgcacgcc 
gcaggcaggt 
tgtcgtccgc 

aggcggtgcg 
cgtagaacgc 
ccgccgcgac 
tgccctcggt 
cggtcaccga 
gttcatccac 
ccgtacccgg 
gcaacgagat 
cggcggccag 
cggccgcctc 
gtcccggcac 
gcgccaacgc 
tgtgagcctg 
acccgtccag 
agtagccctc 
ccgacccggt 
ccacgtgggg 
cctcgtaccg 
acgggccgtt 
acgccaccga 
aggccaccac 
tctcgccctg 
cggccaggct 
catccggccg 
cacactcctc 
ccacccactg 
cacccatcac 
gcaccaaccc 
gccgctccac 
acccatcgac 
cgttcgtacc 
gccacgcccg 
acggctcgtc 



19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 



cacatqcagc gtcttcggcg cgatgccata ccgcatcgcc atgaccatct tgatgacacc 
ggcgafaSc |cagcc|cct gcgcatgacc gatgttcgac "caacgaac -agcagcag 
cqqaacctca cgctcctgcc cgtacgtcgc cagaatcgcg tgcgcctcga tgggatcgcc 
clgcgtcgtc cccgtcccgt gcgcctccac cacgtccacg tcggcggggg cgagccccgc 
c??q?ggagg gcctggcgga tgacgcgctg ctgggagggg ccgttgggtg cggagatgcc 
gttlgSgS ccgtcct|it tgacggcgga ggagcggacg accgcgagga cggtgtgtcc 
qttgcgctcl gcgtcggaga gcttttcgac gacgaggacg ccggccccct cggcgaaacc 
Igtlccgtcc iccgcILIg cgaacgcctt gcaccgtccg tccggcgcga cgccgccctg 
ccglgalaac tccacgaagg tctgtggtga tgccatcact gtgacaccac ^gaccagcgc 
cagcgagcac tccccggtcc gcagcgcctg cccggcctgg tgcagcgcga ccagcgacga 
cgaacacgcc gtgtcgaccg tgaccgccgg accctccatg ccgaagaagt acgacagccg 
??cggcglgc Jccgclggct gtgtgctgta ggcgccgaat ccgcccaggt ^cgcgcccgt 
gccltagccg tagtagaagc cgccgacgaa gacgccggtg tcgctgccgc gcagggtgtc 
cggcacgatg ccggcgtgtt cgagcgcctc ccaggcgatt tcgaggagga tccgctgctg 
cgggtcgagt gcggtggcct cgcgcggact gatgccgaag aacgcggcat cgaagtcggc 
glclcccgcg Sgtlclccgg cccgcccggt ggcggactcg gcggcggcgt gcagcgcggc 
cIcgtccLg ccgcggtcgg tggggaagtc gccgatcgcg tcgcggccgt ccgcgacgag 
ctgccacagc tc^tccgg?! aggtgacgcc gcccggcagt cggcaggcca ^gccgacgac 
ggcgagcggc tcgttcgccg cggcgcgcag cgcggtgttc tcccggcgga SCtgcgcgtt 
atcctLalc gacgtccgca gcgccccgat caggtcgttc tcggccatcg cctcatccct 
fcagcacgtg cgc|atglgc icgtctgcgt ccatgtcgtc gaacagttcg tcgtccggct 
ccgcggtcg? ggt|ctcgcg ggtgcctgtg ccggtggttc accgccgtcc ggggtcccgt 
tg?c|?cc|g iitcccgttg acgtccgggg ccaggagggt cagcagatga cgggtgagcg 
cgccggcggc gggatagtcg aagacgagcg tggccggcag cggaatgccg agggcctcgg 
Hllt%Ttt Iclcagiccg agcgcggtga gcgagtcgac cccgaggtcc ttgaacgccg 
t|gtggc?gt gaccgccgcc gcgtcggtgt ggcccagcag ggtggcggcg gtgtcgcgga 
cSclccgag cagcacctgt tcccgttcct tgtggggcag gtccggcagg cgttccagca 
gSalccgcc gtcggtcgcg gagcgccggg tggggcgctg gatcggtcgc cacagcggtg 
Icggltclcc ^ggc^cgggt ggggcggtcg ccacgaccac ggcttccccg gtggcgcacg 
cggcgtcgag gaggtcggtc agccggtccg ccgcggcggt gaacgccacg gccggcaggc 
c??g?gccc| Ic|lag|?cg gccagggcct ggagcggtcc ggccgcctcg ccggacggaa 24660 
cggcglgaac laLgSgtc aggtcgaggt cgcgggtcag gcggtgcagt tcccaggccg 24720 
actcggcggt iccgtccgcg tggacgaccg cggteaccgg ggtttccggc actgtgcccg 
gctclLccg iatcacttcg gcgccgtgtc cgccgaggtg tccggcgagt tcctccgaac 
cqcccgcgag gaggacggtg tcgccgtacg aggccgcggc cgtggtgggc gcggcgggga 
cSggcgLI cgStcSgg cgcccgtcgg ccaggcgcag gtgcggttcg tcgaggcggg 
alaligSic ggcgcggcgg ggggtgaccg tgtcggtggt ctccacgagc ^cgagccggc 
ccggttccgc Jgtgtcgagc agtgcggcga cggcaccggc gacgggcccg gcctcggcgg 
acJccaccIg Igtggcgccg gcggtcctcg ggtcgtccag tgcggtacgg ^^^^cgtcgg 
qaccggatac cgggacgacg atgacgtcgg gcgtggcgtc gtcgccgagg tcggtgtacc 
Igcgllccgt ggtlccgggt gccgccgggg cccggacgcc ggtccaggtg cgccggaaca 
IcclScgL Iccgtccggg cccgtcgtgg cggggggccg ggtgatgagc 9agccgatct 
iagccaccgg ccgtcccagt tcgtcggcga ggtgcacgcg ggcgccgccc tcgccctcgc 
LLgacgla ggtgacgcgc agtttcgtgg cgccgctggt gtggacacgg acgccggtga 
acgSaa?gg claccgtacc cccgcgttct cggcggccgc gccgatgctg cccgcttgca 
qcgcggtgac gagcagcgcc gggtgcagtg tgtagcgggc ggcgtccctg gcgagggcgc 
cgtc|aglgc ia?ttcggcg cagacggtgt ctccgtggct ccacgcggcg gacatgccgc 
qqaactcSg gccgaactcg tatcccgcgt cgtcgagtcg ctggtagaag gccgcgacgt 
cSccgg??c cgcltgctcg ggcggccagg gccccggcgt ggtggccggt ^cggtggtgg 
cgatgccggc gaagccggag gcgtggcggg tccatgtccg gtcgccgtcc gtccgggcgt 
qgaclcgcac |gcacggcgt ccggtgtcgt cgggcgcggc gacggtcacg cgcacctgga 
Sgcgccggt ggcgggcagg accagcggtg tctcgacgac cagttcgtcg agcaggtcgc 
aqcctqcctc g^cggcgccg cgtccggcca attccaggaa ggcgggtccg ggcagcagta 25980 
cigcgccgtc gacgaltga c^ggcLgcc atgggtgggt ggccagcgag aaccggccgg 26040 
tllgcagcac ctcgcggag tcggggagcg ccaccgacgc ggcgagcagc gggtggtcga 26100 
cggcgtcgag tccgaggccg gaagcgtccg tgccggccgc ggtctcgatc cagtagcgct 
catggtggaa ggcgtatgtg ggcaggtcgt gtgccgtcgc cgtcgcgggg acgaccgccg 
cccagtcgac gggcacgccg gttgtgtgcg cctcggccag cgcggtgagc agccggtgga 



22860 

22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640. 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 



24780 

24840 

24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 



26160 
26220 
26280 



9 



a 
w 

Q 
TO 

m 



ctcccccgcc 

ggtgcgcgct 

tggcgaagcc 

cgatccagcg 

ccgcgacgat 

agtcgacggc 

cgacggcgtc 

agacgccgtc 

ccatcgcgcc 

ggcgggcacc 

gggagtgtcc 

acaccatgac 

cgtcgagcat 

gcatcctggc 

gcggtccttg 

cgacgtcgtc 

ccgcggcgat 

ggacctggcc 

tggcgatcag 

ccgggtgggc 

cggcgcgccg 

cgccggccgt 

tgccgtgccg 

tgtggccgat 

aggtggccag 

cctccacggc 

cccgctcctg 

ccgccgaacc 

tctcgacgat 

acgccttgca 

acggcgaggc 

gtgactgccc 

ccgccggacc 

tgccggtcgc 

ccatgaacac 

gcgcctccca 

gcggactgat 

gacgcacggt 

aaccacggtc 

agtcctccgg 

gctcgtcctg 

gcgccgcggt 

cgggcagccg 

agtcgacgcc 

cgagtacggc 

cggagagccg 

cccggcgcgg 

gcgccgggtc 

gcgccgtcac 

gttcccacag 

ccagcgcgtc 

caccggcggc 

gcaggtgcca 

gcgcggtgag 

gcgccgggtc 

cgatcgccgt 

gcagccggcg 

cggagccacc 



gcggcggagc 
gacctcgacg 
tacggggtgg 
ttcgtcggcg 
ccgctggagt 
gatgcggcgc 
cgggcgcccg 
gatccgggcg 
gcgtccggcg 
gtcctccagg 
gatgacggcg 
ggcccagcag 
ggcgatgggg 
ggcgaacacc 
tccggggaag 
gtcgagcagc 
ggcgcgcggg 
gtcgagggcc 
cggctcaccg 
ttccagcagg 
cgggcggtcg 
ccagtcgacg 
catggcgagg 
gttggacttc 
caccgcctgt 
gtccacgtcc 
cgagggcccg 
ccggacaacc 
cagcacaccg 
gcgcgcgtcg 
catcaccgtg 
ggcctggtgc 
ctccagaccg 
gccgaaaccg 
gccggtgtcg 
cgaggtctcc 
cccgaagaac 
cgacgtgccc 
cgtcggaaac 
cgacgcgacc 
ccggacggcc 
gagcttcgcc 
tacgcccgtc 
gagttccttg 
cgcggtgcac 
cgcgatccgg 
tgcgcgcagc 
cgaggaccgc 
gccgtcgcgg 
gccccaggcc 
gaggaacgcg 
cgacgagtag 
cgcggcgtcc 
gacgccgtcg 
gatccccgcc 
gacctcggcg 
cacgccgtgg 
ggtgacgagc 



gtggcgacgg 

aacacggtgt 

cgcatgttgc 

gtggagaacc 

tcgtcgtaca 

acccagacgc 

gcgacggtcg 

gcatccgcct 

agttcgcgca 

gtgagcgctc 

tccgggcgta 

acggggtgca 

tcccagcccg 

ggggaggccg 

acgaagacgg 

acggcgcggt 

tcgtggccgg 

gtggcggtcc 

ggcttcgagg 

acgtgggcgt 

gtctcgggcc 

tgcgaggacg 

accatcttga 

agcgagccca 

gcctcgatgg 

gccggggtga 

ttcggcgccg 

gccagcacac 

gacccctcgg 

ggcgcgagac 

acgccgccga 

agcgccacca 

tagaagtacg 

cccaggtcgg 

cttccgcgca 

aggaccagac 

gccgcgtcga 

ggatgatccg 

gccgtgatcc 

ccacccggca 

gcggtcgtgg 

gcgacggcgc 

gcctcggtga 

aacgtggcgg 

tgccggacga 

tcggcgaggg 

aggggcgagc 

aacgccgcgt 

cgcatgcggg 

acggacaacg 

ttcgcggccg 

aggacgaacg 

accttcgggc 

tcgaggacgg 

agtacggagg 

ccgggcacgt 

cgttcgacga 

acggtgccgt 



tcgcgccgtc 

cacccggctc 

ggaaccagta 

acgggatctc 

gcgggtcgac 

cgcgggcctc 

tggtggtggc 

cgacgtcggc 

ggagcaggag 

cggcgacaca 

cgcccgcggc 

cgacgtcgac 

tgtgcgggat 

ccatcagttc 

tgcgcggctc 

gcgggaacgt 

gacgggcggc 

gcgccgagac 

ccgacggctc 

tggtgccgct 

a999ccgggc 

gcgtgtccac 

tgacaccggc 

gcagcaccgg 

gatcgcccag 

gcccggcgtt 

acaacccgtt 

ggtggccgtt 

cgaaaccggt 

cccgctgctg 

ccagggcgag 

gcgacgacga 

acagccgacc 

tgccgagtcc 

gcgactccgg 

gctgctgcgg 

agtccgccac 

gatcgggatc 

cgtcaccacc 

gccggcaggc 

tgcgggtcgg 

gcggcgtcgg 

aggcgttgcg 

tcgcctcgac 

cggcgagcac 

tggtggcgcc 

tgccgaggcc 

cgaacagcgt 

cgccggtgcc 

cgggcagtcc 

cgtagttgcc 

cggccagttc 

gcagcaccgt 

ccgcggtgtg 

cgagttcgtc 

cgctcgccgt 

ggtggcggct 

ccgggtcgag 



gatcgcgggc 

gcgggcagcg 

ctcgtcgtcg 

99gcgtgcgc 

gaacggggtg 

gtagtcggcg 

gccgttgcgg 

ggccgggagc 

aacgctgcgc 

ggccgcggcg 

ctcccacacg 

gcggcgggtc 

cagcgcgtcg 

gacgcccatg 

ggtgagcgcc 

cgtacgcctg 

gaggtgctcg 

gggcagtggt 

ctcggccggc 

gacgccgaag 

atcggtgagg 

gtgcagggtg 

gacacccgcg 

ggtgtcgcgc 

cctggtgccg 

ggccagggcc 

ggaagcaccg 

gcgctcggca 

gccgtcagcc 

ggagaactcg 

cgagcattcg 

acacgccgtg 

ggacagcaca 

gtacccgtcg 

gaggatcccg 

gtccatcgcc 

cccggcgagg 

gtacagcccg 

cgactccagc 

catccccacg 

cgatgccgtc 

gaagtcgaag 

cagccggatc 

ccgtgcggca 

gtccttttcg 

99ccgcccgg 

ggccgggtcg 

cagtccgcct 

gaccgtcagc 

ggctgcccgg 

ctgtccgggg 

cgtgtcctgg 

ctcgagccgg 

cacgacggcc 

ccggtcggcg 

gccgctgcgc 

gatgatgccg 

cgccggagcg 



agcagcacgg 

gtcacggccg 

agcggcgcgt 

gaggtggtgt 

tgggtcgggc 

atcagcgttt 

cccgcgaccc 

gcgaccgagc 

agcgcgacga 

atctcgccct 

gcggccagcg 

acctccgggt 

gcgcattggc 

ccgcgccact 

gtgccggtga 

gcgagcaggc 

cggagtcggc 

gtgagcggcg 

ggctccccgg 

gaggacacac 

agttcgacgg 

cgcggcaggg 

gcggcctgag 

ccctgcccgt 

gtgccgtgcg 

tgccggatca 

tcctggttga 

tcggagagcc 

gcatccgcga 

acgaagccgg 

ccggagcgca 

tcgaccgtga 

ctggtctggg 

gagaaggcgc 

gcgtgttcca 

agcgcctcac 

aagccaccat 

tccacgtccc 

agccgccaca 

atcgccaacg 

cggccggaca 

accgcggtgg 

gccatgagcg 

ccgtcgtggc 

gcgtccgcgg 

cgccgcggct 

gcggcgacca 

tcggcggtca 

ccgctctccg 

cgctgttcgg 

ctgccgagca 

gtgagttcgt 

tcgggggtga 

gtgagcgggt 

acgtcgcagg 

gacagcatca 

gccagcgtcc 

tcacccgccg 



26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 

29040 

29100 

29160 

29220 

29280 

29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 
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ggaccgccgg ggccagacgg cgggcgtaca cctggccgtc acgcagcacc acctggggct 29820 

catcgagcgc ggtggccgct gcgagcagcg gctcggcggt gtccggggcg gcgtcgacga 2 9880 

ggacgatccg gccggggtgt tcggcctgcg cggtccgcac cagtccggcg gccgcggccg 2 9940 

acgcgagacc gggcccggtg tggacggcca ggaccgcgtc ggcgtaccgg tcgtcggtga 3 0000 

ggaagcgctg cacggcggtc aggacgccgg cgcccagttc gcgggtgtcg tcgagcgggg 30060 

caccgccgcc gccgtgcgcg gggaggatca ccacgtccgg gaccgtcggg tcgtcgaggc 30120 

ggccggtcgt cgcggtcgtg ggcggcagct ccgggagctc ggccagcacc gggcgcagca 3 0180 

ggcccggaac ggctcccgtg atcgtcaggg ggcgcctgcg cacggcgccg atggtggcga 3 024 0 

cgggcccgcc ggtctcgtcc gcgaggtgta cgccgtcagc ggtgacggcg acgcgtaccg 30300 

ccgtggcgcc ggtggcgtgg acgcggacgt cgtcgaacgc gtacggaagg tggtcccctt 3 0360 

ccgcggcgag gcggagtgcg gcgccgagca gcgccgggtg caggccgtac cgtccggcgt 30420 

cggcgagctg tccgtcggcg agggccactt ccgcccagac ggcgtcgtcg tcggcccaga 3 0480 

cggcgcgcgg gcggggcagc gcgggcccgt ccgtgtaccc ggctcgggcc agacggtcgg 3 054 0 

cgatgtcgtc ggggtccacc ggccgggccg tggcgggcgg ccacgtcgac ggcatctccc 3 0600 

gcacggccgg ggccgtccgc gggtcggggg cgaggattcc gtgcgcgtgc tcggtccact 30660 

cccccgccgc gtgccgcgtg tgcacggtga ccgcgcggcg gccgtccgcc ccgggcgcgc 3 072 0 

tcaccgtgac ggagagcgcg agcgcaccgg accgcggcag cgtgaggggg gtgtccacgg 3 0780 

tgaacgtgtc gagggcgccg cagccggctt cgtcgcccgc ccggatcgcc agatccagga 3 084 0 

gggccgcggc gggcagcacc gcgaggccgt gcagggagtg cgccagcgga tcggcggcgt 3 0900 

cgacccggcc ggtgagcacc aggtcgccgg tgccgggcag ggtgaccgcc gcggtcagcg 3 0960 

ccgggtgcgc gaccggcgtc tgtccggccg gggccgcgtc gcccgcggtc tgggtgccga 31020 

gccagtagcg gacccgctcg aacgggtacg tcggcgggtg cgaggcgcgt gccggcgcgg 31080 

ggtcgatgac cttcggccag tcgaccgtga cgccgtcggt gtgcagccgg gcgagcgcgg 3114 0 

tcagggcgga tcgcggttcg tcgtcggcgt gcagcatcgg gatgccgtcg acgagtcggg 31200 

tcaggctccg gtccgggccg atctccagga gcaccgcccc gtcgtgcgcg gcgacctgtt 31260 

ccccgaaccg gacggtgtcg cggacctgtc gtacccagta ctccggcgtg gtgcaggcgg 31320 

cgcccgcggc catcgggatc ctcggctcgt ggtacgtcag gctctccgcg accttgcgga 31380 

actcctcgag catcggctcc atccgcgccg agtggaacgc gtggctggtc cgcaggcggg 3144 0 

tgaagcggec gagccgggcc gcgacgtcga gcaccgcctc ctcgtcaccg gagagcacga 31500 

^ tcgacgcggg cccgttgacc gcggcgatct ccacgccgtc ccgcagcagc ggcagcgcgt 31560 

Q cccgttccga cgcgatcacg gcggccatcg ccccgccgga cggcagcgcc tgcatcaggc 31620 

33 gggcccgtgc ggacaccagc ctgcacgcgt cctccaggga ccagacgccg gcgacgtacg 31680 

fU cggcggccag ctcgccgatc gaatggccca cgaaggcgtc cgggcgtacg ccccacgcct 31740 

SI cgagctgtgc gccgagtgcg acctggagcg cgaacaccgc gggctgggcg tacccggtgt 31800 

Q cgtggaggtc gagcccggcg ggcacgtcga gggcgtccag cacctcgcgg cgagtgcggg 31860 

1^ cgaagacgtc gtaggcggcg gccagtccgt cgcccatgcc gggacgttgt gagccctgtc 31920 

cggagaagag ccacacgagg cggcggtccg gttctgcggc gccggtgacc gtgtcggtgc 31980 

cgatcagcgc ggcccggtgc gggaaggccg tgcgggcgag cagggccgcg gccaccgcgc 32 040 

gctcgtcctc ctcgccggtg gcgaggtggg cgcgcaggcg gtgtacctgt gcgtcgagtg 32100 

cctgcggggt gcgtgccgag agcagcaggg gcagcggtcc ggtgtcgggt gccggggcgg 32160 

gttcgggggc cggtcggggg tggctttcga ggatgatgtg agcgttggtg ccgctaacgc 32220 

cgaaggagga caccccggcg cgccgtgggc ggtcggtttc gggccagggg cgggcgtcgg 32280 

tgaggagttc gacggcgccg gccgtccagt cgacgtgcga ggacggcgtg tccacgtgca 32340 

gggtgcgcgg cagggtgccg tgccgcatgg cgaggaccat cttgatgaca ccggcgacgc 32400 

ccgcggcggc ctgagtgtgg ccgatgttgg acttcagcga gcccagcagc accggggtgt 32460 

cgcgatgctg cccgtaggtg gccagtaccg cctgcgcctc gatggggtcg cccagcctgg 32520 

tcccggtgcc atgcgcctcg acagcgtcca catccgccgg ggtgagcccg gcgttggcca 32580 

gcgcctgccg gatcacccgc tcctgcgacg gcccgttcgg cgccgacaac ccgttggaag 32640 

caccgtcctg gttgaccgcc gaaccacgca cgaccgccag gacattgtgg ccgtgccgct 32700 

cggcgtcgga gagcctctcg acgatcagca caccggatcc ctcggcgaaa ccggtgccat 32760 

cagccgcatc cgcgaacgcc ttgcagcggc cgtccgggga gaggccccgc tgctgggaga 32 820 

agtccacgaa gccggacggc gaggccatca ccgtgacgcc gccgaccacg gcgagcgagc 32880 

actcccccga gcgcagcgac tgcccggcct ggtgcagcgc caccagcgac gacgaacacg 32940 

ccgtgtccac cgtgaccgcc ggaccctcca aaccgtagaa gtacgacagc cgaccggaca 33 000 

gcacactggt ctgggtgctg gtggcaccga aaccgccgcg gtcggctcca gtgccgtacc 33 060 

cgtagaagta gccgcccatg aacacgccgg tgtcgcttcc gcgcagcgac tccgggagga 33120 

tcccggcgtg ttccagcgcc tcccacgagg tctccaggac cagacgctgc tgcgggtcca 33180 

tcgccagcgc ctcacgcgga ctgatcccga agaacgccgc gtcgaagtcc gccaccccgg 33240 
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cgaggaagcc 

gcccgtccac 

ccagcagccg 

ccacgatcgc 

tggcccgcgc 

cgaagacgag 

cgacgccggt 

gggcgtcgcg 

gcgcggccgg 

ggacccggtc 

ggtcggtgtg 

tgccgttgcg 

ccgcgtccca 

gggcgagcgc 

acgtggcgga 

cgtgcaggtg 

gcatggtcgt 

gctgggcgac 

cgtaccgcac 

cgacctcggc 

cggtgccgcc 

cgacacggcg 

cgccggcggc 

cgacgcggcc 

cgggatcgcc 

gccaggtctg 

^9gtgcccgg 

gcacgtcggc 

tctcgaacag 

ggaccggtga 

ccagcagcac 

acgccagccg 

cgagcagcac 

acgcgtaggc 

acgagagcgg 

gccagtccac 

gcgcccaggg 

cggttccgac 

cgatggtcag 

ccacgagcgc 

gctgacggcg 

cccacgagcc 

ggtcacggcg 

tgacgggcac 

cctcgcctcg 

ccagtgcggt 

ccgccaggtg 

aggcggcgtc 

ccggcgtgcg 

catgcgcggt 

gcagctcctc 

cggcgacctc 

ccatgccgcc 

tcgcggcgtc 

agtggccgac 

ccatcaccgc 

gccgctgggc 

actcgcggag 



accatgacgc 

gtcccaacca 

ccacaagtcc 

caacggctcg 

gccggccagt 

cgtagcgggc 

cagcgagtcg 

gtggccgagc 

aggtgcggac 

ggacgcggcg 

cagggccgcg 

ggcgatgcgg 

cagtccccag 

gtcgaggaac 

tatggacgag 

ccaggcgacg 

cacggccgcg 

gtcggcgacg 

gcggtcgtcc 

ggcctcgtgc 

ggtgacgagg 

cagacgggcc 

gagcccggcc 

gggatgctcc 

ggtacgggtg 

cacggtggtg 

gtcgccgggt 

gaggtacgtc 

cgcctcggca 

gccgtgctcg 

gcgcagcgcg 

gcgccgctcc 

ggggtgcagc 

gcggccctcc 

cagcgcgtcg 

gggctccgcc 

gcccgtgccg 

ggtggcctgg 

ctccgcgatc 

cgagccgggc 

taccgagaca 

gagcagcggg 

gaacgggtac 

gccccggacc 

ccgcagtgtg 

ggtgagcacg 

gccggtcgcg 

cgcgggccgg 

cggagtgatg 

gtgcgacgcg 

cacggcgtcg 

caggcgcccg 

ctgcccggcc 

gtccagggtg 

gaccgcggcc 

gaacgacgcg 

gatgacgtcc 

ccgccgggcg 



acggtcgacg 

cggtccgtcg 

tccggcgacg 

tcctgccgga 

tcgtccaggt 

agcgtcaggc 

aagcccactt 

accgcggcag 

gtgcgccgga 

acggcggcga 

tcgaacaggg 

gccaggtcgg 

gcgagcgaga 

gcgttgccgg 

tacaggacga 

tccgccttga 

tcgtcgacga 

actgcggcca 

tccggcgtgt 

acggtgagca 

acggtcccgc 

gcacgcgctg 

gctatggcgg 

gtctccgccg 

gccacgatga 

agcaggtcgc 

tccacggcca 

cagtcgggga 

tcggggtcgg 

tccgtggcga 

gtcgcggcgc 

gggtccgtga 

ccgtaccggg 

cccgtccaca 

tagaagccgg 

ggaccgccag 

gtacggctgt 

atctccgtgt 

tccggcgtgc 

acgatgaccc 

ccgcggtggc 

tggccggacg 

gtgggcagcg 

cagagcgcgg 

ccggtgacga 

99atgcgcgc 

gcggcgaacc 

tccagccacg 

ccggcgagag 

tagtcgacgg 

gccgcaccgg 

gcccacacgg 

agttcggtgg 

agcaccccgg 

ggggcgaccc 

ggctgcacga 
agcaggtccc 
aacacgggct 



tgcccggatg 

gaaacgccgt 

cgaccccacc 

cggccgcggt 

gggcggcgag 

ccgtcgcgtc 

ccctgaacgc 

cgctggtacg 

cggccggcac 

ggtcgagccg 

cgagcccctg 

tggcggtcag 

cggcgggcag 

tcgcgtagtt 

acgcggccag 

cccgcagcac 

tcccggccat 

gctcgtcgcg 

cgccgggccg 

ggtggtccac 

cggtcagcgg 

tgccgtcggc 

cgggcgtgat 

tccggaccag 

gccgggatcg 

ggcccagctc 

ggaccacgac 

cgggtgacgc 

cggcccgcac 

cgatgcggac 

gcgcgtggat 

agaccgtccc 

cgtcggtgag 

tcgcggtcat 

tcaggtcggc 

tgtccacgct 

gcagactcac 

cgccgtcgcc 

cgagccgggc 

ggccgtccac 

cagcgcgccc 

ttcccgccgg 

gcaccacccg 

cgagcgaccg 

ccgtatgcgc 

tgacctcgac 

gaacggtgcg 

cctcgtccac 

cgtcgagcag 

cgatccggcg 

cgacaacgat 

cggcgtcgaa 

cgacgagtcg 

cgacgcaggc 

cgtgcgcacg 

catcgacccg 

atccggtgtg 

cggtggcgag 



atccggatcg 

gatcccgtca 

cggcagccgg 

cggggtacgc 

cgcctgcgcc 

ggccagccgg 

gcgcgcgggt 

gacgaggtcg 

gagggtgcgt 

gatcggcacg 

tgcggccgtc 

ccgcccgccc 

cccctggtgg 

ggcctgaccc 

gtcgagatcg 

ggcgtcccac 

gtgcacgacg 

gtcgacgacg 

gccgttgcgg 

gaggaggcgg 

ggaggttccg 

gacccggacg 

ctcgtccgct 

gccgccgagc 

cgcccagcgc 

ccgggtccgg 

cggggggtgc 

gggcacgggc 

ggtcaggctg 

catgtcgggg 

cctcacgccg 

gagggcgtgc 

ctgttcggcg 

ggcccggaac 

cgggtcggcg 

cagcgctccg 

cgaccgccgt 

gtcgaccacc 

tcccgcttcg 

ctcgtggtcg 

tcgccgtcgg 

ttccgcgtcg 

acgcgtcgcg 

agtgaagcgg 

atgcccggcg 

gaacgcgcgg 

gcgcaggttg 

ggtggagaag 

cgcgccgcgg 

ggcgcggggg 

cgacgcgggt 

gtcggcgggc 

gctgcgcacc 

cgcggcgact 

ccacagctcc 

gtcgaacgcg 

cggggcgagc 

cagttcggca 



ggatcgtaca 

ccacccgact 

caggccatcc 

cgccgggtgg 

gtggggtggt 

ttgcgcagtt 

gcgatggcgt 

agcatgtcgc 

aggaccggcg 

agcgcgggcc 

atcggggtca 

atcccgtccg 

tgccggtggc 

gcgccgccga 

cgcgtcagct 

tgctccggcc 

gcgcgcagcc 

tcggcggcca 

gacaccacga 

ccgagcccgc 

gtggccgcgg 

tgcggctcgt 

tcgatcaggg 

gcttcctgcg 

ggctcggcga 

gcgccgggcg 

tcgccgtcgg 

acccaggcga 

tcgacgtcaa 

ccgacgcgtt 

gaccaggaga 

agggccgcgt 

aggcggaccg 

gcgggcccgt 

tcggcgggcg 

gtcgcactga 

ccggacacct 

accggcgcga 

gcgagcagtt 

gcgagccagg 

gcgaggtcga 

atccagtagc 

aacgaccagg 

tccaggccgc 

agcgtgtcct 

tatccgcggt 

tcgtaccagt 

aacgggacgt 

atcgtttcga 

gtggcggcca 

ccgttgaccg 

ggcaccgaga 

gcgacgacct 

tcgccctggg 

gccagcgcca 

ggcgctccgg 

gccgtggcgc 

cccatgccgg 



33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 

36000 

36060 

36120 

36180 

36240 

36300 

36360 

36420 

36480 

36540 

36600 

36660 

36720 
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cccactggga gccctgcccg gggaacgcga acacgacacg tgtgtcggtg acgtcggcgg 36780 

ttcccg?clc Igcccccggc acttcggcac cacgggcgaa cgcctccgcc tctcgggccg 36840 

pcacglccgc ccggtggcgc atggccgtcc gggtggtggc gagcgagtgg ccgaccgcgg 36900 

ccgc£cg?c ag?|a|?gig gccagctgtc ccgcgacgtc ccgcagtccc tccggggtcc 36960 

gglc?|acat cSgccagacc acgtcctcgg gcaccggctc ggcttcgggt gcggacacgg 37020 

gtgcgggcgc ggcggggggc ccggcctcca ggacgacatg ggcgttggtg ccgctgatgc 37080 

cglaclacia Scacccgca cgccgggcgc gcccggtgac cggccacggc tcactgcggt 37140 

acScagclg iatgtcgccg tcccagtcga cgtgccggga cggctcgtcg acgtgcagcg 37200 

?acqcggca| Iac|ccitgl cgcatcgcca tgaccatctt gatgacgccg gcgacgccgg 37260 

ccgSccS Igtltglccg aJgttcgact tgagcgagcc gatcagcagc gs-tgcacgc 37320 

gt?c|cgccl ILigccact tgcagggcct gggcctcgac ggggtcgccg -3--99gtgc 37380 

Iggtlccgtg tgcctccacg gcgtcgacgt cacccggcgc -aggccggcg ^^S^^^agcg 37440 

cacqctggat gacgcgctgc tgcgcaggcc cgttcggggc ggacagcccg ttcgacgcgc J/buu 

cgtcggStt laclgcggig ccgcgcacca gcgccagcac ggggtggccg tggcgggtgg 37560 

cScllalag ccgc^cSagc accaggacac cggcgccctc ggcgaagctc gtgccgtccg 37620 

cIgStccgc galggccttg gcacggccgt cgggggcgag cccgcgctgc cgggagaact 37680 

cScqaaccc ggtcgtcgtc gccatcaccg tgacaccgcc gaccagggcg agcgagcact 37740 

cccccgagcg clgcIacJgc gcggcctggt gcagcgccac cagcgacgac gaacacgccg 37800 

totcgaSgt gaccgacggg ccctccagac cgaagtagta cgagagccgc ccggagagaa 37860 

clctggtSg cgtgccgi?! gccccgaaac cgcccaggtc cacgcccgcg ccgtagccct 37920 

a gggtSacg glclatgaat acgccggtgt cgctgccgcg gacgctttcg gg^aggatgc 37980 

^£ cc|c?cgt?c iaacgcctcc cacgacgctt cgaggaccag acgctgctgc ggg^ccatcg 38040 

^ ccIqcqJctc Icgcgggctg atcccgaaga acgcggcgtc gaagtcggcg gcgccggtga 38100 

J ggSg?cgcc gt|acgaci gaaaccttgc cgaccgcgtc ggggttcggg tcgtagagcg 38160 

Q Sgciaggtc ccagccgcgg tcggcgggga actcggtgat cgcgtccccg ^cggagtcga 38220 

W cSgcclcca caggtcctcc ggtgaccgca cgccaccggg catccggcac gccatggcca 38280 

S cgScScag cggctcgttc cccgccaccg tcggtgcggg cactgtcgcc gc^ggagcgg 38340 

; cSggSccgl c?laccccgc cgttcctcat ccaggcgggc ggcgagcgcg g^^ggtgtcg 38400 

^' qgtStcgS gacggccgtc gcggagagcc gtacccccgt cgtctcggcg ^ggctgttgc 38460 

" Salccglac LciStgagc gagtcgatgc cgaggtcctt gaacgccgtc gtgggcgtga 38520 

^ Stcggalgc gtclgcgtgg ccgagcacgg cggccgtggc cgcacacacg atggccagca 38580 

^ lltalcallc Lggtcgcgg tcgcggtcgc ggttgtcctc cgcacgggcg gcgatgcggc 38640 

rU |ltcgg?ccg ct^ccglall ggctcggtgg gaatcgccgc gaccatgaac ggcacgtccg 38700 

---4 cggcSggct cgcgtcgatg aagtgggtgc cctcggcctc ggtgagcggc ^ggaacccgt 38760 

n cScQcacccg ctgccggtcg gcgtcgtcaa gttgtccggt gagggtgctg gtggtgtgcc 38820 

C ac^lgc^cc! ggcgaggal Itggcgggtt ggccgagggt gtggcggtgg J^g^^aggg 388 0 

catcqaqqaa ggcgttggcg gcggcgtagt ttccttgtcc ggggctgccg aggacggcgg JHS^u 

c|gc|c?|ga iL|agSc| Sagtgggtga ggggttggtt ttgggtgagg tggtgcaggt 39000 

gcSIgclic ittggctttg gggtggagga cggtggtgag gcggtcgggg g^gjgggcgt 39060 
cgaglltlcc Itcgtcgagg gtggcggcgg tgtggaagac ggcggtgagg gg^gg^^^^ 

tgtgggcgag ggtggtggcg agttggtggg ggtcgccgac gtcgcagggg ^gg^ggg^^^ ^!^°° 

cIgStIg? Itcigggggt ggggtgcggg agaggaggta ggtgtggggg tggttcaggt 39240 

ggciggcgag gatgccggcg agggtgccgg agccgccggt gatgatgatg g^g^g^cgg 39300 

||t?|lgggl Sgtggtggtg ggtggggtgg tggtgtggag gggggtgagg tggggtcggt 39360 

Saglg?|?| Pgggtgagg cggaggtggg ggtggtcgag ggtggcgagt tgggccaggg 39420 

qqaqqqgagl gtgggggtgg tcggtttcga tgaggcggat gcggtggggg tgttcgttct 3 9480 

SqcqScg IgtlaScS gtgacggtgg cgccggcggg gtcggtggtg gtgtggacga 39540 

?qlqqgt|t| Itclg^lgtl gtgaggtggt gttgcagggc ggtcaggacg cgggtggcgc 39600 

qqgStSgc Icglitiig? Stgtcctcgg ggtcgtcggg gtgggcggcg gtgatcagga 39660 

lltalclclc iggcIggtL ccgtcgtaga ccgcctcggc gaccgcgagc cactccaacc 39720 

glalcSgtt cSccSgac gglgtgtcgg cccgctccct cagcaccagc gagtccaccg 39780 

IScgacagg acggccatcc gggtcggcca cgcgcacggc gacgccggcc tccccccggg 39840 

tgaglgcgac gcgcaccgcg gcggccccgg tggcgttcag gcgcacgccc g^^caggaga 39900 

a?glSgctc laLccgccg cccgcgtcga ggcgcccggc gtgcagggcc g^g^cgagca 39960 

gtgccglatg cacacc|aaa ccgtccgcct cggcggcctg ctcgtcgggc agcgccacct 40020 

cqqcaScac ggtgtcacca tcacgccagg cagcccgcaa cccctggaac gccgacccgt 40080 

a??cataacc Scltcccgc agttcgtcat agaaccccga gacgtcgacg gccgcggccg 40140 

tggccggcgg ccactgcgig alcggctcac cggaagcgtt ggaggtatcc ggggtgtcgg 40200 
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gggtcagggt 

cggtcaccgg 

ctgcggtcac 

cggtctcgtc 

tgccccgcac 

gaacaacacc 

ccgccccggt 

gcctgtgctc 

ccgtgcccca 

gctcccagcc 

gcagcagcac 

ccgcatccag 

cggtcaccca 

ttcccttcag 

cgtagtcgac 

cctccaccgc 

tccacacacc 

cccggccggc 

cctccaggct 

ccacagcgtc 

cccagctggc 

cccgcacatc 

cgaacaccgc 

cggggaagac 

ccagcagcac 

cggccacatc 

gactcacctc 

cacgcgacgg 

acgacgacac 

gcagctccac 

tcttcggcgc 

cagccgcctg 

ggtcctgccc 

cggtgccgtg 

cctgccggat 

cgtcctggtt 

cgtcggagag 

ccgcgtcggc 

ccacgagctc 

ccccggcccg 

tgtcgaccgt 

cgctcgtctg 

ggttgaacgc 

cggcgttctc 

ccagcgcctc 

atccgccgtg 

cgacgtccca 

gccgccacag 

tcgcgacggg 

cggcgaggtg 

cccgcagacc 

ggccgttctc 

cggtggcgac 

cggcgaggcg 

cggtgaggat 

tccgggccac 

gcgccggccg 

cccgtggccg 



gccgctggcg 
ccgccgtccg 
cggcaccacg 
accggcccgg 
cgcgtgatca 
accaccgtcg 
cagcccggcc 
gaacgcgtag 
gtccaccccc 
accgtcacca 
cggatgggca 
cgcgacaggg 
ggcgctgtcc 
tacctcagcg 
cgcgatacga 
cgacgggtcc 
ctcgaccaga 
cagccgcgcc 

gsgggctccg 

cggcacgacc 
cggctggacc 
ccagcccgtg 
ggaacggtcc 
gaacaccgta 
cgcacggtga 
caccccaccc 
accacgagcc 
cccaggaaca 
acccgcatgc 
cgcaccggcc 
gatcccatgc 
cgcatgaccg 
gtaggccgcg 
cgcctccacc 
cacgcgctgc 
caccgccgag 
ccgctccagc 
gaacgccttg 
tgcggtgttc 
cagtgcctgt 
gaccgccggg 
cgtcgccgtg 
gcccatgaac 
gaacgcctcc 
gttcggactg 
gcgtgtcgtg 
gccccggtcg 
gtcctccggc 
gtcgccggag 
ggcggcgaac 
cgtccgcgcg 
gcggaacgtg 
gctgtcgcgg 
gttcgcccac 
cggcggcgtg 
gatgtacgag 
ttcgatgccg 
ggtgtgggcg 



tgccgggtcc 

gcctcatcgg 

agcggggatt 

atgaccagct 

gccagccagg 

tcggcgggca 

gcggacaggt 

gtgggcagat 

gcacccagag 

gtccgcaacg 

ctgcactcca 

cgacgcaggt 

acggtcgacc 

agttcgtcct 

cgcacccgca 

cccgccacca 

cccacctcac 

gcgatcaccc 

gccacacacg 

ccatgcgcct 

acctccaccc 

tgcggcaaca 

atgagttcca 

cgcggctgat 

ccgaagacag 

ccgcgcagat 

gacaccggca 

ccctccagga 

ggtgcccgat 

gaccagtcca 

cgcatcgcca 

atgttcgact 

aggacggcct 

acgtccacat 

tgggcgacgc 

ccgcggacga 

acgagaacgc 

caccgtccgt 

gccatgacgg 

gccgcctggt 

ccctgaagtc 

acaccgagcc 

acgccggtgt 

caggaggtct 

atgccgaaga 

gagcggccgg 

gtggggaact 

gaggcgaccc 

ccgagggtct 

gcacgcggag 

gcgacggtgt 

cggtccgggg 

accaggtcga 

tcctgttccg 

gcgcccgcca 

ccgccgcccg 

ggcagcgcgc 

tcggcgccgg 



agctgcccgt 

ccccttccac 

cgatgaccag 

ccacaaacgc 

gatgcgtacg 

gtgctgtgac 

cggtggcacc 

ccagcagccg 

tccacgcctg 

acgccaccgt 

cgaacaccga 

tccggtacca 

accacgccac 

cgatggcctc 

ccccatcagc 

ccgtcgaagc 

cggccggcaa 

gactgcgcaa 

ccgccgcgat 

gccacagcgc 

gctccgccac 

acgcccgcgc 

cgcccatgcc 

ccaccgccac 

cacgctcacg 

acccctccag 

acggcaccaa 

tcacgtgcgc 

ccgactcggg 

catgcgacga 

tgaccatctt 

tgaccgaacc 

gcgcctcgat 

cggcggcgcg 

cgttgggggc 

ccgcgagaac 

cgacgccctc 

ccggggagag 

tgacaccgcc 

gcagggcgac 

cgtacacgta 

cgcccaggtc 

cgctctcccg 

ccaggatcag 

acgcggcgtc 

ccgcgtccgg 

cggtgatcgc 

cgccgggcag 

gggcggtcgc 

tggggtggtc 

tggtgaactc 

agcagtgtcc 

gcagtacgtc 

tggcgtcggg 

tcgtcgcggc 

cgatggcctt 

ggacggtgac 

ccgggccgtc 



gccctcggta 

ggtcaccgac 

ttcatccacc 

cgtacccggc 

caatgagatc 

ggcggccagc 

ggccgcctcc 

ccccggcacc 

cgccaacgcc 

gcgggcctgt 

cccgtccagc 

gtacccctca 

cgacccggtc 

cacgtgaggc 

ctcataccgc 

cggaccatta 

cgccaccgaa 

cgccaccacg 

ctccccctgc 

ggccaggctc 

atccgaccgc 

acactcctcc 

cacccactgg 

acccatcacc 

caccaacccc 

ccgctccacc 

cccatcacca 

gttcgtaccg 

ccacggcctc 

cggctcgtcc 

gatgacaccg 

gaggtagagc 

cgggtcgccc 

cagtccggcg 

ggacagtccg 

ggtgtgcccg 

9gcgaagccg 

tccgcgctgc 

gaccagcgcc 

cagcgacgac 

cgagaggcgc 

ccggccgacg 

gagcctgtcc 

gcgctgctgg 

gaacccggcg 

gtccgggtcg 

ctcggtaccg 

tcggcacgcc 

gggtgccgct. 

gaacgcggtt 

gacggtggtg 

ggcgcccggc 

ctcccggccc 

ctcggccggt 

ccgcgccccg 

ctcgatcagg 

ggtggggagt 

gagcaggacg 



cgcgcgtgga 

acatccaccg 

accccgcaac 

agcagaaccg 

cggccggtga 

atcggatgcg 

agccagtacc 

ggttcgacca 

cccagccacc 

tccatcgccg 

tccgccaccg 

tccaccggct 

ccgccggaaa 

gtgtgggagg 

gccaccacct 

cgcgccgcga 

gccatcgccc 

cgggcggcgt 

gagtgtccga 

accgcgaccg 

gacaacatct 

atacgagccg 

gcaccctgcc 

cgggcatcac 

tgcgcgaccg 

tgcccccgca 

cccgactcca 

ctcaccccga 

gcctcggtga 

acgtgcagcg 

gcgacacccg 

ggcgtgtcgc 

agccgcgtgc 

ttgaccaacg 

ttggaggcac 

ttgcgctcgg 

gtcccgtccg 

cgggagaact 

agggagcact 

gagcacgccg 

ccggacagga 

ccgtagccct 

ggcacgatgc 

gggtccatcg 

ccggccagga 

tacagcgcgt 

gcggcgacga 

atgccgacga 

gtcgcggagc 

gacgcgggca 

agcgagtcga 

aggcccagga 

gcacgggccg 

ccggtcagtg 

gcggaaccgg 

tcgccggtga 

ccctccgcgg 

tgcacgagcg 



40260 

40320 

40380 

40440 

40500 

40560 

40620 

40680 

40740 

40800 

40860 

40920 

40980 

41040 

41100 

41160 

41220 

41280 

41340 

41400 

41460 

41520 

41580 

41640 

41700 

41760 

41820 

41880 

41940 

42000 

42060 

42120 

42180 

42240 

42300 

42360 

42420 

42430 

42540 

42600 

42660 

42720 

42780 

42840 

42900 

42960 

43020 

43080 

43140 

43200 

43260 

43320 

43380 

43440 

43500 

43560 

43620 

43680 
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43740 
43800 



1^ 



43920 
43980 
44040 
44100 
44160 
44220 
44280 
44340 
44400 
44460 
44520 



cgccggggtt cgcggcttcc tcggctgcgg tggtcacgtg ggtgaggccg gtctcgtcgc 
ggagcaggcc ggcgacggtg tcggcgtcct ccccggtgac caggaccggc gcgtccgggc 

cgatcggagg cggcacggtg aggaccatct tgccggtgtg ccgggcgtgg ctcatccacg 43 860 

cgaacgcgtc ccgcgcacgg cggatgtccc acggctgcac cggcagcggg cacagctcac A^qon 
cgcggtcgaa caggtcgagg agcagttcga ggatctcccg caggcgcgcg ggatccacgt 
cggccaggtc gaacggctgc tgggcggcgt ggcggatgtc ggtcttgccc atctcgacga 
accggccgcc cggtgcgagc aggccgatgg acgcgtcgag gagttcaccg gtgagcgagt 
tgagcacgac gtcgaccggc gggaaggtgt cggcgaacgc ggcgctgcgg gagttcgcca 
catggtcggt gtcgaagccg tcggcgtgca gcaggtgttg tttggcggga ctggcggtgg 
cgtacacctc ggcgccgagg tggcgggcga tccgggtcgc cgccatgccg acaccgcccg 
tcgcggcgtg gaccaggacc ttctggccgg gtcgcagctc gcccgcgtcg acgaggccgt 
accaggcggt ggcgaacacg atgggcacgg acgcggcgat ggggaacgac catccccgtg 
ggatccgtgc gaccagccgc cggtccgcga ccacgctgcg ccggaacgcg tcctgcacga 
gaccgaacac gcggtcgccg ggggccaggt cgtcgacgcc gggtccgact tcggtcacga 

tgcccgcggc ctccccgccc atctcgccct cgcccgggta ggtgccgagc gcgatcagca 44580 

cgtcgcggaa gttcagcccc gcggcgcgga cgtcgatgcg gacctcgccg gcggccaggg 44 64 0 

gcgcggcggg acgtcgagcg gggcgacgac gaggtcgcgg agcgttccgg aggcgggcgg 44700 

gcgcagcgcc cactggcgcg gtcggcaggg gggtggtgtc cgcgcgtacc agccggggca 44 76 0 

cgtaggccac gccggcccgc agcgcgatct ggggttcgcc gagcgaggcc gcggcgggga 44 82 0 

cgaggtcgtc atcgccgtcc gtgtccacca gcacgaacga tccgggttcg gcggcctggc 44880 

O ggcgcagcgc ctcgtcccag agccgggcct ggtccgcgtc cgggatctcg gccgggccga 44 94 0 

^ cgcccaccgc gcggcgggtg acgaccgtcc ggcggggtga cggggtgccg ggcaggtcgc 4 5000 

Ji gccgctccca gaccagttcg cacagcgtgg cctcgccact gccggtggcg accagatggg 45060 

S ccggcagccc cgcgagccgc gcgcgctgga ccttgcccga cgcggtgcgg gggatcgtgg 45120 

tgacgtgcca gatctcgtcg ggcaccttga agtaggcgag ccggcggcgg cactcggcga 45180 

r'l ggatcgcctc ggcggggacg cgggggccgt cggaaacgac gtagagcacg ggtatgtcgc 45240 

cgaggacggg gtgcgggcgg cccgccgcgg cggcgtcccg gacaccggcc acctcctggg 4 53 00 

^ cgacggtctc gatctcccgg gggtggatgt tctccccgcc gcggatgatc agctccttga 453 60 

cccggccggt gatcgtcacg tgtccggtct cggcctgacg tgcgaggtcc ccggtgcggt 4 5420 

^ accagccgtc cacgagcacc tgggcggtcg cctccggctg ggcgtggtag ccgagcatga 45480 

O ggctcggccc gctcgcccac agctcgccct cctcgccggg tgccacgtcg gcgccggaca 45540 

'iQ ccgggtcgac gaaccgcagc gacaggcccg gcacgggcag cccgcacgag ccgggaaccc 4560 0 

rU gcgcatcctc cagggtgttg gcggtgagcg agccggtcgt ctcggtgcag ccgtacgtgt 45660 

K\ cgagcagggg cacgccgaac gtcgcctcga aatccctggt gagcgacgcc ggcgaggtgg 4 572 0 

atccggcgac cagcgccacg cgcagcgcgc gagcccgcgg ctcgccggac acggcgccga 45780 

ggaggtagcg gtacatcgtc ggcacgccga cgagcacggt gctggagtgt tcggccaggg 45840 

cgtcgaggac gtcacgcgcg acgaagccgc ccaggatacg ggcggacgcg ccgaccgtga 45900 

ggacggcgag caggcagagg tggtggccga ggctgtggaa cagcggggcg ggccagagca 45960 

gttcgtcgtc ctcggtcagc cgccaggacg gcacgtcgca gtgcatcgcg gaccacaggc Af;n9n 
cgctgcgctg tgcggaaacc acgcccttgg gacggccggt ggtgccggag gtgtagagca 
tccaggcggg ttcgtccagg ccgaggtcgt cgcggggcgg gcacggcggc tcggtcccgg 
cgaggtcctc gtaggagacg cagtccggtg cccggcgccc gacgagcacg acggtggcgt 
cggtgccggt gcggcgcacc tggtcgaggt gggtttcgtc ggtgaccagc acggtcgcgc 
cggagtccgt caggaagtgg gcgagttcgg cgtcggcggc gtccgggttg agcgggacgg 
cgacggcggc ggcgcgggcg gcggcgaggt agacctcgat ggtctcgatc cggttgccga 

gcagcatcgc gacccggtcg ccgcggtcga cgccggacgc ggcgaggtgt ccggcgagcc 46440 

ggccggcccg gagccggagt tgcgtgtacg tcacggcgcg ttgggaatcc gtgtaggcga 4 6 500 

tccggtcgcc gcgtcgctcg gcatggatgc ggagcaattc gtgcaacggc cggattggtt 46560 

ccacacgcgc catggaaaca cctttctctc gaccaaccgc acaacagcac ggaaccggcc 46620 

acgagtagac gccggcgacg ctagcagcgt tttccggacc gccaccccct gaagatcccc 46680 

ctaccgtggc cggcctcccc ggacgctcat ctagggggtt gcacgcatac cgccgtgcgt 46740 

aattgccttc ctgatgaccg atgccggacg ccagggaagg gtggaggcgt tgtccatatc 46800 

tgtcacggcg ccgtattgcc gcttcgagaa gaccggatca ccggacctcg agggtgacga 4 6860 

gacggtgctc ggcctgatcg agcacggcac cggccacacc gacgtgtcgc tggtggacgg 4 6 920 

tgctccccgg accgccgtgc acaccacgac ccgtgacgac gaggcgttca ccgaggtctg 4 698 0 

gcacgcacag cgccctgtcg agtccggcat ggacaacggc atcgcctggg cccgcaccga 47040 

cgcgtacctg ttcggtgtcg tgcgcaccgg cgagagcggc aggtacgccg atgccaccgc 47100 

ggccctctac acgaacgtct tccagctcac ccggtcgctg gggtatcccc tgctcgcccg 47160 



46020 
46080 
46140 
46200 
46260 
46320 
46380 
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gacctggaac tacgtcagcg gtatcaa'cac gacgaacgcg gacgggctgg aggtgtaccg 47220 
ggacttctgc gtgggccgcg cccaggcgct cgacgagggc gggatcgacc cggccaccat 4 7280 
gcccgcggcc accggtatcg gcgcccacgg gggcggcatc acctgcgtgt tcctcgccgc 47340 
ccggggcgga gtgcggatca acatcgagaa ccccgccgtc ctcacggccc accactaccc 4 7400 
gacgacgtac ggtccgcggc ccccggtctt cgcacgggcc acctggctgg gcccgccgga 4 7460 
ggggggccgg ctgttcatct ccgcgacggc cggcatcctc ggacaccgaa cggtgcacca 47520 
cggtgatgtg accggccagt gcgaggtcgc cctcgacaac atggcccggg tcatcggcgc 4 7580 
ggagaacctg cggcgccacg gcgtccagcg ggggcacgtc ctcgccgacg tggaccacct 47640 
caaggtctac gtccgccgcc gcgaggatct cgatacggtc cgccgggtct gcgccgcacg 47700 
cctgtcgagc accgcggccg tcgccctttt gcacaccgac atagcccgcg aggatctgct 47760 
cgtcgaaatc gaaggcatgg tggcgtgaca atacccggta aaaggcccgc gacgctgcgc 47820 
ctcggcggat ccgcgaagag aaagaagagc gtcaccgcac agcgcggcag cccggtcctt 47880 
tcgtccttcg cacagcggcg gatctggttt ctccagcaat tggacccgga gagcaacgcc 47940 
tataatctcc cgctcgtgca acgcctgcgc ggtctattgg acgcgccggc cctggagcgt 48000 

gcgctggcgc tcgtcgtcgc gcgccacgag gcgttgcgga cggtgttcga caccgccgac 48060 

ggcgagcccc tccagcgggt gcttcccgcc ccggaacacc tcctgcgcca cgcgcgggcg 48120 

ggcagcgagg aggacgccgc ccggctcgtc cgcgacgaga tcgccgcgcc gttcgacctc 48180 

gccaccgggc cgttgatcag ggccctgctg atccgcctcg gtgacgacga ccacgttctc 48240 

gcggtgaccg tgcaccatgt cgccggcgac ggctggtcgt tcgggctcct ccaacatgaa 48300 

ctcgcagccc actacacggc gctgcgcgac actgcccgcc ctgccgaact gccgccgttg 48360 

Q ccggtgcagt acgccgactt cgccgcctgg gagcggcgcg aactcaccgg cgccggactg 48420 

^ gacaggcgtc tggcctactg gcgcgagcaa ctccggggcg ccccggcgcg gctcgccctc 48480 

cccaccgacc gtccccgccc gccggtcgcc gacgcggacg cgggcatggc cgagtggcgg 48540 

ccgccggccg cgctggccac cgcggtcctc acgctcgcgc gcgactccgg tgcgtccgtg 48600 

ttcatgaccc tgctggcggc cttccaagcg gtcctcgccc ggcaggcggg cacgcgggac 48660 

gtgctggtcg gcacgcccgt ggcgaaccgt acgcgggcgg cgtacgaggg cctgatcggc 48720 

atgttcgtca acacgctcgc gctgcgcggc gacctctcgg gcgatccgtc gttccgggaa 48780 

ctcctcgacc gctgccgggc cacgaccacg gacgcgttcg cccacgccga cctgccgttc 48840 

gagaacgtca tcgaactcgt cgcaccggaa cgcgacctgt cggtcaaccc ggtcgtccag 48900 

gtgctgttgc aggtgctgcg gcgcgacgcg gcgacggccg cgctgcccgg catcgcggcc 48960- 

gaaccgttcc gcaccggacg ctggttcacc cgcttcgacc tcgaattcca tgtgtacgag 49020 

p gagccgggtg gcgcgctgac cggcgaactg ctctacagcc gtgcgctgtt cgacgagcca 4 9080 

pj cggatcacgg ggttgctgga ggagttcacg gcggtgcttc aggcggtcac cgccgacccg 49140 

\| gacgtacggc tgtcgcggct gccggccggc gacgcgacgg cggcagcgcc cgtggtgccc 49200 

tcgaacgaca cggcgcggga cctgcccgtc gacacgctgc cgggcctgct ggcccggtac 49260 

2 gccgcacgca cccccggcgc cgtggccgtc accgacccgc acatctccct cacctacgcg 4 932 0 

' cagctggacc ggcgggcgaa ccgcctcgcg cacctgctcc gcgcgcgcgg caccgccacc 49380 

ggcgacctgg tcgggatctg cgccgatcgc ggcgccgacc tgatcgtcgg catcgtgggg 4 944 0 

atcctcaagg cgggcgccgc ttatgtgccg ctggaccccg aacatcctcc ggagcgcacg 4 9500 

gcgttcgtgc tggccgacgc gcagctgacc acggtggtgg cgcacgaggt ctaccgttcc 4 9560 

cggttccccg atgtgccgca cgtggtggcg ttggacgacc cggagctgga ccggcagccg 4 9620 

gacgacacgg cgccggacgt cgagctggac cgggacagcc tcgcctacgc gatctacacg 49680 

tccgggtcga ccggcaggcc gaaggccgtg ctcatgccgg gtgtcagcgc cgtcaacctg 49740 

ctgctctggc aggagcgcac gatgggccgc gagccggcca gccgcaccgt ccagttcgtg 49800 

acgcccacgt tcgactactc ggtgcaggag atcttttccg cgctgctggg cggcacgctc 49860 

gtcatcccgc cggacgaggt gcggttcgac ccgccgggac tcgcccggtg gatggacgaa 4 992 0 

caggcgatta cccggatcta cgcgccgacg gccgtactgc gcgcgctgat cgagcacgtc 49980 

gatccgcaca gcgaccagct cgccgccctg cggcacctgt gccagggcgg cgaggcgctg 5004 0 

atcctcgacg cgcggttgcg cgagctgtgc cggcaccggc cccacctgcg cgtgcacaat 50100 

cactacggtc cggccgaaag ccagctcatc accgggtaca cgctgcccgc cgaccccgac 50160 

gcgtggcccg ccaccgcacc gatcggcccg ccgatcgaca acacccgcat ccatctgctc 50220 

gacgaggcga tgcggccggt tccggacggt atgccggggc agctctgcgt cgccggcgtc 50280 

ggcctcgccc gtgggtacct ggcccgtccc gagctgaccg ccgagcgctg ggtgccggga 5034 0 

gatgcggtcg gcgaggagcg catgtacctc accggcgacc tggcccgccg cgcgcccgac 50400 

ggcgacctgg aattcctcgg ccggatcgac gaccaggtca agatccgcgg catccgcgtc 50460 

gaaccgggtg agatcgagag cctgctcgcc gaggacgccc gcgtcacgca ggcggcggtg 50520 

tccgtgcgcg aggaccggcg gggcgagaag ttcctggccg cgtacgtcgt accggtggcc 50580 

ggccggcacg gcgacgactt cgccgcgtcg ctgcgcgcgg gactggccgc ccggctgccc 50640 
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gccgcgctcg tgccctccgc cgtcgtcctg gtggagcgac tgccgaggac cacgagcggc 50700 

aaggtggacc ggcgcgcgct gcccgacccg gagccgggcc cggcgtcgac cggggcggtt 5 0760 

acgccccgca ccgatgccga gcggacggtg tgccggatct tccaggaggt gctcgacgtc 50820 

ccgcgggtcg gtgccgacga cgacttcttc acgctcggcg ggcactccct gctcgccacc 50880 

cgggtcgtct cccgcatccg cgccgagctg ggtgccgatg tcccgctgcg tacgctcttc 50940 

gacgggcgga cgcccgccgc gctcgcccgt gcggcggacg aggccggccc ggccgccctg 51000 

cccccgatcg cgccctccgc ggagaacggg ccggcccccc tcaccgcggc acaggaacag 51060 

atgctgcact cgcacggctc gctgctcgcc gcgccctcct acacggtcgc cccgtacggg 51120 

ttccggctgc gcgggccact cgaccgcgaa gcgctcgacg cggcactgac ccggatcgcc 51180 

gcgcgccacg agccgctgcg gaccgggttc cgcgatcggg aacaggtcgt ccggccgccc 5124 0 

gctccggtgc gcgccgaggt ggttccggtg ccggtcggcg acgtcgacgc cgcggtccgg 51300 

gtcgcccacc gggagctgac ccggccgttc gacctcgtga acgggtcgtt gctgcgtgcc 51360 

gtgctgctgc cgctgggcgc cgaggatcac gtgctgctgc tgatgctgca ccacctcgcc 51420 

ggtgacggat ggtccttcga cctcctggtc cgggagttgt cggggacgca accggacctt 51480 

ccggtgtcct acacggacgt ggcccggtgg gaacggagtc cggccgtgat cgcggccagg 51540 

gagaacgacc gggcctactg gcgccggcgg ctggggggcg ccaccgcgcc ggagctgccc 51600 

gcggtccggc ccggcggggc accgaccggg cgggcgttcc tgtggacgct caaggacacc 51660 

gccgtcctgg cggcacgccg ggtcgcggac gcccacgacg cgacgttgca cgaaaccgtg 51720 

ctcggcgcct tcgccctggt cgtggcggag accgccgaca ccgacgacgt gctcgtcgcg 51780 

acgccgttcg cggaccgggg gtacgccggg accgaccacc tcatcggctt cttcgcgaag 51840 

gtcctcgcgc tgcgcctcga cctcggcggc acgccgtcgt tccccgaggt gctgcgccgg 51900 

gtgcacaccg cgatggtggg cgcgcacgcc caccaggcgg tgccctactc cgcgctgcgc 51960 

gccgaggacc ccgcgctgcc gccggccccc gtgtcgttcc agctcatcag cgcgctcagc 52020 

gcggaactgc ggctgcccgg catgcacacc gagccgttcc ccgtcgtcgc cgagaccgtc 52080 

gacgagatga ccggcgaact gtcgatcaac ctcttcgacg acggtcgcac cgtctccggc 52140 

gcggtggtcc acgatgccgc gctgctcgac cgtgccaccg tcgacgattt gctcacccgg 52200 

gtggaggcga cgctgcgtgc cgccgcgggc gacctcaccg tacgcgtcac cggttacgtg 52260 

^ gaaagcgagt agccatgccc gagcaggaca agacagtcga gtaccttcgc tgggcgaccg 52320 

01 cggaactcca gaagacccgt gcggaactcg ccgcgcacag cgagccgttg gcgatcgtgg . 523 80 

^ ggatggcctg ccggctgccc ggcggggtcg cgtcgccgga ggacctgtgg cagttgctgg 52440 

g agtccggtgg cgacggcatc accgcgttcc ccacggaccg gggctgggag accaccgccg 52500 

^ acggtcgcgg cggcttcctc accggggcgg ccggcttcga cgcggcgttc ttcggcatca 52560 

fy gcccgcgcga ggcgctggcg atggacccgc agcagcgcct ggccctggag acctcgtggg 52620 

aggcgttcga gcacgcgggc atcgatccgc agacgctgcg gggcagtgac acgggggtgt 52680 

^ tcctcggcgc gttcttccag gggtacggca tcggcgccga cttcgacggt tacggcacca 5274 0 

r: cgagcattca cacgagcgtg ctctccggcc gcctcgcgta cttctacggt ctggagggtc 52800 

cggcggtcac ggtcgacacg gcgtgttcgt cgtcgctggt ggcgctgcac caggccgggc 52 860 

agtcgctgcg ctccggcgaa tgctcgctcg ccctggtcgg cggcgtcacg gtgatggcct 52920 

cgccggcggg gttcgcggac ttctccgagc agggcggcct ggcccccgac gcgcgctgca 52 98 0 

aggccttcgc ggaagcggct gacggcaccg gtttcgccga ggggtccggc gtcctgatcg 53 04 0 

tcgagaagct ctccgacgcc gagcgcaacg gccaccgcgt gctggcggtc gtccggggtt 53100 

ccgccgtcaa ccaggacggt gcctccaacg ggctgtccgc gccgaacggg ccgtcgcagg 53160 

agcgggtgat ccggcaggcc ctggccaacg ccggactcac cccggcggac gtggacgccg 53220 

tcgaggccca cggcaccggc accaggctgg gcgaccccat cgaggcacag gccgtgctgg 532 8 0 

ccacctacgg gcaggggcgc gacacccctg tgctgctggg ctcgctgaag tccaacatcg 53340 

gccacaccca ggccgccgcg ggcgtcgccg gtgtcatcaa gatggtcctc gccatgcggc 53400 

acggcaccct gccccgcacc ctgcacgtgg acacgccgtc ctcgcacgtc gactggacgg 53460 

ccggcgccgt cgaactcctc accgacgccc ggccctggcc cgaaaccgac cgcccacggc 53 52 0 

gcgccggtgt ctcctccttc ggcgtcagcg gcaccaacgc ccacatcatc ctcgaaagcc 53580 

acccccgacc ggcccccgaa cccgccccgg cacccgacac cggaccgctg ccgctgctgc 53640 

tctcggcccg caccccgcag gcactcgacg cacaggtaca ccgcctgcgc gcgttcctcg 53700 

acgacaaccc cggcgcggac cgggtcgccg tcgcgcagac actcgcccgg cgcacccagt 53760 

tcgagcaccg cgccgtgctg ctcggcgaca cgctcatcac cgtgagcccg aacgccggcc 5382 0 

gcggaccggt ggtcttcgtc tactcggggc aaagcacgct gcacccgcac accgggcggc 53880 

aactcgcgtc cacctacccc gtgttcgccg aagcgtggcg cgaggccctc gaccacctcg 53940 

accccaccca gggcccggcc acgcacttcg cccaccagac cgcgctcacc gcgctcctgc 54000 

ggtcctgggg catcaccccg cacgcggtca tcggccactc cctcggtgag atcaccgccg 54 060 

cgcacgccgc cggtgtcctg tccctgaggg acgcgggcgc gctcctcacc acccgcaccc 54120 
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gcctgatgga ccaactgccg tcgggcggcg cgatggtcac cgtcctgacc agcgaggaaa 54180 
aggcacgcca ggtgctgcgg ccgggcgtgg agatcgccgc cgtcaacggc ccccactccc 54240 
tcgtgctgtc cggggacgag gaagccgtac tcgaagccgc ccggcagctc ggcatccacc 54300 
accgcctgcc gacccgccac gccggccact ccgagcgcat gcagccactc gtcgcccccc 54360 
tcctcgacgt cgcccggacc ctgacgtacc accagcccca caccgccatc cccggcgacc 5442 0 
ccaccaccgc cgaatactgg gcgcaccagg tccgcgacca agtacgtttc caggcgcaca 54480 
ccgagcagta cccgggcgcg acgttcctcg agatcggccc caaccaggac ctctcgccgc 54540 
tcgtcgacgg cgttgccgcc cagaccggta cgcccgacga ggtgcgggcg ctgcacaccg 54600 
cgctcgcgca gctccacgtc cgcggcgtcg cgatcgactg gacgctcgtc ctcggcgggg 54660 
accgcgcgcc cgtcacgctg cccacgtatc cgttccagca caaggactac tggctgcggc 5472 0 
ccacctcccg ggccgatgtg accggcgcgg ggcaggagca ggtggcgcac ccgctgctcg 54780 
gcgccgcggt cgcgctgccc ggcacgggcg gagtcgtcct gaccggccgc ctgtcgctgg 5484 0 

cctcccatcc gtggctcggc gagcacgcgg tcgacggcac cgtgctcctg cccggcgcgg 54 900 

ccttcctcga actcgcggcg cgcgccggcg acgaggtcgg ctgcgacctg ctgcacgaac 54 960 

tcgtcatcga gacgccgctc gtgctgcccg cgaccggcgg tgtggcggtc tccgtcgaga 55020 

tcgccgaacc cgacgacacg gggcggcggg cggtcaccgt ccacgcgcgg gccgacggct 55080 

cgggcctgtg gacccgacac gccggcggat tcctcggcac ggcaccggca ccggccacgg 5514 0 

ccacggaccc ggcaccctgg ccgcccgcgg aagccggacc ggtcgacgtc gccgacgtct 552 0 0 

acgaccggtt cgaggacatc gggtactcct acggaccggg cttccggggg ctgcgggccg 552 60 

cctggcgcgc cggcgacacc gtgtacgccg aggtcgcgct ccccgacgag cagagcgccg 553 2 0 

^ acgccgcccg tttcacgctg caccccgcgc tgctcgacgc cgcgttccag gccggcgcgc 55380 

Q tggccgcgct cgacgcaccc ggcggggcgg cccgactgcc gttctcgttc caggacgtcc 5544 0 

^ gcatccacgc ggccggggcg acgcggctgc gggtcacggt cggccgcgac ggcgagcgca 555 00 

gcaccgtccg catgaccggc ccggacgggc agctggtggc cgtggtcggt gccgtgctgt 55560 

^ cgcgcccgta cgcggaaggc tccggtgacg gcctgctgcg cccggtctgg accgagctgc 55620 

Q cgatgcccgt cccgtccgcg gacgatccgc gcgtggaggt cctcggcgcc gacccgggcg 55680 

Ul acggcgacgt tccggcggcc acccgggagc tgaccgcccg cgtcctcggc gcgctccagc 55 740 

^ gccacctgtc cgccgccgag gacaccacct tggtggtacg gaccggcacc ggcccggccg 55800 

gi . ctgccgccgc cgcgggtctg gtccgctcgg cgcaggcgga gaaccccggc cgcgtcgtgc 55860 

2 tcgtcgaggc gtccccggac acctcggtgg agctgctcgc cgcgtgcgcc gcgctggacg 5592 0 

Q aaccgcagct ggccgtccgg gacggcgtgc tcttcgcgcc gcggctggtc cggatgtccg 55980 

accccgcgca cggcccgctg tccctgccgg acggcgactg gctgctcacc cggtccgcct 56040 

ccggcacgtt gcacgacgtc gcgctcatag ccgacgacac gccccggcgg gcgctcgaag 56100 

ccggcgaggt ccgcatcgac gtccgcgcgg ccggactgaa cttccgcgat gtgctgatcg 56160 

cgctcgggac gtacaccggg gccacggcca tgggcggcga ggccgcgggc gtcgtggtgg 56220 

O agaccgggcc cggcgtggac gacctgtccc ccggcgaccg ggtgttcggc ctgacccggg 56280 

'r^^ gcggcatcgg cccgacggcc gtcaccgacc ggcgctggct ggcccggatc cccgacggct 56340 

ggagcttcac cacggcggcg tccgtcccga tcgtgttcgc gaccgcgtgg tacggcctgg 56400 

tcgacctcgg cacactgcgc gccggcgaga aggtcctcgt ccacgcggcc accggcggtg 5 6460 

tcggcatggc cgccgcacag atcgcccgcc acctgggcgc cgagctctac gccaccgcca 56520 

gtaccggcaa gcagcacgtc ctgcgcgccg ccgggctgcc cgacacgcac atcgccgact 56580 

ctcggacgac cgcgttccgg accgctttcc cgcgcatgga cgtcgtcctg aacgcgctga 5664 0 

ccggcgagtt catcgacgcg tcgctcgacc tgctggacgc cgacggccgg ttcgtcgaga 56700 

tgggccgcac cgagctgcgc gacccggccg cgatcgtccc cgcctacctg ccgttcgacc 56760 

tgctggacgc gggcgccgac cgcatcggcg agatcctggg cgaactgctc cggctgttcg 56820 

acgcgggcgc gctggagccg ctgccggtcc gtgcctggga cgtccggcag gcacgcgacg 56880 

cgctcggctg gatgagccgc gcccgccaca tcggcaagaa cgtcctgacg ctgccccggc 56940 

cgctcgaccc ggagggcgcc gtcgtcctca ccggcggctc cggcacgctc gccggcatcc 57000 

tcgcccgcca cctgcgcgaa cggcatgtct acctgctgtc ccggacggca ccgcccgagg 57060 

ggacgcccgg cgtccacctg ccctgcgacg tcggtgaccg ggaccagctg gcggcggccc 57120 

tggagcgggt ggaccggccg atcaccgccg tggtgcacct cgccggtgcg ctggacgacg 57180 

gcaccgtcgc gtcgctcacc cccgagcgtt tcgacacggt gctgcgcccg aaggccgacg 57240 

gcgcctggta cctgcacgag ctgacgaagg agcaggacct cgccgcgttc gtgctctact 573 00 

cgtcggccgc cggcgtgctc ggcaacgccg gccagggcaa ctacgtcgcc gcgaacgcgt 57360 

tcctcgacgc gctcgccgag ctgcgccacg gttccgggct gccggccctc tccatcgcct 57420 

gggggctctg ggaggacgtg agcgggctca ccgcggcgct cggcgaagcc gaccgggacc 57480 

ggatgcggcg cagcggtttc cgggccatca ccgcgcaaca gggcatgcac ctgtacgagg 57540 

cggccggccg caccggaagt cccgtggtgg tcgcggcggc gctcgacgac gcgccggacg 57600 
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. r^r^r^^cr^ acocaacctq cggcggacga ccgtccggcg ggccgccgtc cgggagtgtt 

Sb! Sss ^ 521 r,^:s 

S : ScSS? S^S- c--- ™ 151^2 

llctgccSg cggggtcgcg tcacccgagg agctgtggca ^ctcgtggca ^ccggcaccg 
S^gcLtcac ggagttccc. acggaccgc^ gctgggacgt cgacgcgatc -cgacccgg ^^^^^ 

SttcScIc Igcg??c??c ggcatcSgcc ^IcgcIIglc cctcgcgatg 9acccgcagc 58260 
Igcggltgct llt|gagacg tcgtgggagg cgttcgaaag cgccggcatc accccggact 5 320 
c|aclcg?gg cagcgacacc ggcgtgttcg tcggcgcctt f^J^^^^^^ tcSccgg? 58440 

is^ScS s . - .-00 

ii^iiim ssss =cg| 1 ii^sS 

gcggcctcgc gccggscggc ogggcgaagg cgttcggcgc 999tgcggac 99= g,^^ 
LIfcgagig tgccggtgtg ctg.tcgtcg ^9.g9ctotc =9.c9=cgaa =9=aa=ggtc 

?g?cglcr== SaSScg gS? S^9"cc; IS^ilcSg ^cca.cgg saas„ 

SS=a ISgfcS cl!r.gga| |ag|SJ jS 

Sl-^cc SI^S - 

•l^glcclg^ KfcSclgg cScgf^^l SIc^l^cK ^^^^ | g. 9cg J ....o 

claacgccca cgtcatcctg gaggccggac cggtaacgga gacgcccgcg S^^^cgcctt by 

b-SS -cSc^g -=acg!lS Sg g| s«00 

agcatcccgc gatgggcgag cagctcgccg =^9^^^^^^^ ccccacgcac Jgc?agcl?g 59640 
lllTclllt ccfccSgc^ SgScSI Srcc^rcg gtcc^SSc a^cacjccgc S..0O 
^o^ScIt coacca^LI Itiggcgaga tcaccgcggc gcacgccgcc ggcatcctgt 

be- gfer=r. g?gi:r.r.i s^^^rcS r4«9cg| 

Sll dbr= l9g"9lSg --ag ^ _ 

^gSlgSfa raSggfccc I^Sllgl" .jy^S .0.0 

ll^lll ltltll?.ll SSIc -gggg"l9 1^=^^==!= g.=9cg|.lg 

fal SSl^-g gSlIgccl? tl^l^^l cSgclSs 

°gS:rcS ttrcgt9?=l .c?StgcgI Iccgcgcggt 9ttcgtcgcc 9a9«99=gc 
cigccgclll ggaogcggtc gactgcgcc. =9gt=gagcg 9Ctcgacatc 9=«=c9t9c 
oligccggcc gggccatggc cggacg.ccg t««9a="9 9gt=g«9.g ==?g=gg 9 

SiS iSiS lis ii S5i SSgs = 

tcttcgccga ggccgaggtg gacggaccgg ^^^^"^^^^ ^ccggatgg cgcgalctgi 60960 
acgcggtctt ctccgcggtc jgcgacggaa 9ccgccagcc ^g-gatgg 

cggtgcacgc gtcggacgcc accgtactgc 9^^^^^^^^^ actcaccqcg gaggcggtga 61080 
ccatgggatt cgccgccttc gacggcgccg gcctgccggt actcaccgcg gagg yy y 
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cgctgcggga ggtggcgtca ccgtccggct ccgaggagtc ggacggcctg caccggttgg 61140 

agtggctcgc ggtcgccgag gcggtctacg acggtgacct gcccgaggga catgtcctga 61200 

tcaccgccgc ccaccccgac gaccccgagg acatacccac ccgcgcccac acccgcgcca 61260 

cccgcgtcct gaccgccctg caacaccacc tcaccaccac cgaccacacc ctcatcgtcc 61320 

acaccaccac cgaccccgcc ggcgccaccg tcaccggcct cacccgcacc gcccagaacg 61380 

aacaccccca ccgcatccgc ctcatcgaaa ccgaccaccc ccacaccccc ctccccctgg 61440 

cccaactcgc caccctcgac cacccccacc tccgcctcac ccaccacacc ctccaccacc 61500 

cccacctcac ccccctccac accaccaccc cacccaccac cacccccctc aaccccgaac 61560 

acgccatcat catcaccggc ggctccggca ccctcgccgg catcctcgcc cgccacctga 61620 

accaccccca cacctacctc ctctcccgca ccccaccccc cgacgccacc cccggcaccc 61680 

acctcccctg cgacgtcggc gacccccacc aactcgccac caccctcacc cacatccccc 61740 

aacccctcac cgccatcttc cacaccgccg ccaccctcga cgacggcatc ctccacgccc 61800 

tcacccccga ccgcctcacc accgtcctcc accccaaagc caacgccgcc tggcacctgc 61860 

accacctcac ccaaaaccaa cccctcaccc acttcgtcct ctactccagc gccgccgccg 61920 

tcctcggcag ccccggacaa ggaaactacg ccgccgecaa cgccttcctc gacgccctcg 61980 

ccacccaccg ccacaccctc ggccaacccg ccacctccat cgcctggggc atgtggcaca 62040 

ccaccagcac cctcaccgga caactcgacg acgccgaccg ggaccgcatc cgccgcggcg 6210 0 

gtttcctccc gatcacggac gacgagggca tgcgcctcta cgaggcggcc gtcggctccg 62160 

gcgaggactt cgtcatggcc gccgcgatgg acccggcaca gccgatgacc ggctccgtac 6222 0 

cgcccatcct gagcggcctg cgcaggagcg cgcggcgcgt cgcccgtgcc gggcagacgt 6228 0 

Q tcgcccagcg gctcgccgag ctgcccgacg ccgaccgcgg cgcggcgctg accaccctcg 6234 0 

^ tctcggacgc cacggccgcc gtgctcggcc acgccgacgc ctccgagatc gcgccgacca 624 00 

cgacgttcaa ggacctcggc atcgactcgc tcaccgcgat cgagctgcgc aaccggctcg 62460 

■^^ cggaggcgac cgggctgcgg ctgagtgcca cgctggtgtt cgaccacccg acacctcggg 62520 

^ tcctcgccgc caagctccgc accgatctgt tcggcacggc cgtgcccacg cccgcgcgga 62580 

cggcacggac ccaccacgac gagccactcg cgatcgtcgg catggcgtgc cgactgcccg 62640 

'f^ gcggggtcgc ctcgccggag gacctgtggc agctcgtggc gtccggcacc gacgcgatca 62700 

^ ccgagttccc caccgaccgc ggctgggaca tcgaccggct gttcgacccg gacccggacg 627 6 0 

0*1 cccccggcaa gacctacgtc cggcacggcg gcttcctcgc cgaggccgcc ggcttcgatg 6282 0 

s ccgcgttctt cggcatcagc ccgcgcgagg cacgggccat ggacccgcag cagcgcgtca 62880 

Q tcctcgaaac ctcctgggag gcgttcgaga acgcgggcat cgtgccggac acgctgcgcg 6294 0 

g gcagcgacac cggcgtgttc atgggcgcgt tctcccatgg gtacggcgcc ggcgtcgacc 630 00 

fQ tgggcgggtt cggcgccacc gccacgcaga acagcgtgct ctccggccgg ttgtcgtact 63060 

tcttcggcat ggagggcccg gccgtcaccg tcgacaccgc ctgctcgtcg tcgctggtcg 63120 

Q ccctgcacca ggcggcacag gcgctgcgga ctggagaatg ctcgctggcg ctcgccggcg 63180 

gtgtcacggt gatgcccacc ccgctgggct acgtcgagtt ctgccgccag cggggactcg 6324 0 

^ cccccgacgg ccgttgccag gccttcgcgg aaggcgccga cggcacgagc ttctcggagg 63300 

gcgccggcgt tcttgtgctg gagcggctct ccgacgccga gcgcaacgga cacaccgtcc 633 60 

tcgcggtcgt ccgctcctcc gccgtcaacc aggacggcgc ctccaacggc atctccgcac 63420 

ccaacggccc ctcccagcag cgcgtcatcc gccaggccct cgacaaggcc gggctcgccc 63480 

ccgccgacgt ggacgtggtg gaggcccacg gcaccggaac cccgctgggc gacccgatcg 6354 0 

aggcacaggc catcatcgcg acctacggcc aggaccgcga cacaccgctc tacctcggtt 63600 

cggtcaagtc gaacatcgga cacacccaga ccaccgccgg tgtcgccggc gtcatcaaga 63660 

tggtcatggc gatgcgccac ggcatcgcgc cgaagacact gcacgtggac gagccgtcgt 6372 0 

cgcatgtgga ctggaccgag ggtgcggtgg aactgctcac cgaggcgagg ccgtggcccg 63 780 

acgcgggacg cccgcgccgc gcgggcgtgt cgtcgctcgg tatcagcggt acgaacgccc 63 840 

acgtgatcct tgagggtgtt cccgggccgt cgcgtgtgga gccgtctgtt gacgggttgg 63900 

tgccgttgcc ggtgtcggct cggagtgagg cgagtctgcg ggggcaggtg gagcggctgg 63960 

aggggtatct gcgcgggagt gtggatgtgg ccgcggtcgc gcaggggttg gtgcgtgagc 64020 

gtgctgtctt cggtcaccgt gcggtactgc tgggtgatgc ccgggtgatg ggtgtggcgg 64080 

tggatcagcc gcgtacggtg ttcgtctttc ccgggcaggg tgctcagtgg gtgggcatgg 6414 0 

gtgtggagtt gatggaccgt tctgcggtgt tcgcggctcg tatggaggag tgtgcgcggg 642 00 

cgttgttgcc gcacacgggc tgggatgtgc gggagatgtt ggcgcggccg gatgtggcgg 64260 

agcgggtgga ggtggtccag ccggccagct gggcggtcgc ggtcagcctg gccgcactgt 64320 

ggcaggccca cggggtcgta cccgacgcgg tgatcggaca ctcccagggc gagatcgcgg 64 3 80 

cggcgtgcgt ggccggggcc ctcagccttg aggacgccgc ccgcgtggtg gccttgcgca 6444 0 

gccaggtcat cgcggcgcga ctggccgggc ggggagcgat ggcttcggtg gcattgccgg 64500 

ccggtgaggt cggtctggtc gagggcgtgt ggatcgcggc gcgtaacggc cccgcctcga 64560 
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cagtcgtggc cggcgagccg tcggcggtgg aggacgtggt gacgcggtat gagaccgaag 64620 
gcgtgcgagt gcgtcgtatc gccgtcgact acgcctccca cacgccccac gtggaagcca 64680 
tcgaggacga actcgctgag gtactgaagg gagttgcagg gaaggccgcg tcggtggcgt 
ggtggtcgac cgtggacagc gcctgggtga ccgagccggt ggatgagagt tactggtacc 
ggaacctgcg tcgccccgtc gcgctggacg cggcggtggc ggagctggac gggtccgtgt 
tcgtggagtg cagcgcccat ccggtgctgc tgccggcgat ggaacaggcc cacacggtgg 
cgtcgttgcg caccggtgac ggcggctggg agcgatggct gacggcgttg gcgcaggcgt 
ggaccctggg cgcggcagtg gactgggaca cggtggtcga accggtgcca gggcggctgc 
tcgatctgcc cacctacgcg ttcgagcgcc ggcgctactg gctggaagcg gccggtgcca 
ccgacctgtc cgcggccggg ctgacagggg cagcacatcc catgctggcc gccatcacgg 
cactacccgc cgacgacggt ggtgttgttc tcaccggccg gatctcgttg cgcacgcatc 
cctggctggc tgatcacgcg gtgcggggca cggtcctgct gccgggcacg gcctttgtgg 65280 
agctggtcat ccgggccggt gacgagaccg gttgcgggat agtggatgaa ctggtcatcg 65340 
aatcccccct cgtggtgccg gcgaccgcag ccgtggatct gtcggtgacc gtggaaggag 
ctgacgaggc cggacggcgg cgagtgaccg tccacgcccg caccgaaggc accggcagct 
ggacccggca cgccagcggc accctgaccc ccgacacccc cgacaccccc aacgcttccg 
gtgttgtcgg tgcggagccg ttctcgcagt ggccacctgc cactgccgcg gccgtcgaca 
cctcggagtt ctacttgcgc ctggacgcgc tgggctaccg gttcggaccc atgttccgcg 65640 
gaatgcgggc tgcctggcgt gatggtgaca ccgtgtacgc cgaggtcgcg ctccccgagg 65700 
accgtgccgc cgacgcggac ggtttcggca tgcacccggc gctgctcgac gcggccttgc ^Hf^^ 
O agagcggcag cctgctcatg ctggaatcgg acggcgagca gagcgtgcaa ctgccgttct 
cctggcacgg cgtccggttc cacgcgacgg gcgcgaccat gctgcgggtg gcggtcgtac 
ifl cgggcccgga cggcctccgg ctgcatgccg cggacagcgg gaaccgtccc gtcgcgacga 
tcgacgcgct cgtgacccgg tccccggaag cggacctcgc gcccgccgat ccgatgctgc 
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PI gggtcgggtg ggccccggtg cccgtacctg ccggggccgg tccgtccgac gcggacgtgc 66060 

^ tgacgctgcg cggcgacgac gccgacccgc tcggggagac ccgggacctg accacccgtg 6 612 0 

ttctcgacgc gctgctccgg gccgaccggc cggtgatctt ccaggtgacc ggtggcctcg 66180 

C ccgccaaggc ggccgcaggc ctggtccgca ccgctcagaa cgagcagccc ggccgcttct 66240 

^ tcctcgtcga aacggacccg ggagaggtcc tggacggcgc gaagcgcgac gcgatcgcgg 66300 

cactcggcga gccccatgtg cggctgcgcg acggcctctt cgaggcagcc cggctgatgc 66360 

gggccacgcc gtccctgacg ctcccggaca ccgggtcgtg gcagctgcgg ccgtccgcca 66420 

£Q ccggttccct cgacgacctt gccgtcgtcc ccaccgacgc cccggaccgg ccgctcgcgg 66480 

llj ccggcgaggt gcggatcgcg gtacgcgcgg cgggcctgaa cttccgggat gtcacggtcg 66540 

%l cgctcggtgt ggtcgccgat gcgcgtccgc tcggcagcga ggccgcgggt gtcgtcctgg 66600 
Q agaccggccc cggtgtgcac gacctggcgc ccggcgaccg ggtcctgggg atgctcgcgg 
gcgccttcgg accggtcgcg atcaccgacc ggcggctgct cggccggatg ccggacggct 
""^ ggacgttccc gcaggcggcg tccgtgatga ccgcgttcgc gaccgcgtgg tacggcctgg 
tcgacctggc cgggctgcgc cccggcgaga aggtcctgat ccacgcggcg gcgaccggtg 
tcggcgcggc ggccgtccag atcgcgcggc atctgggcgc ggaggtgtac gcgaccacca 
gcgccgcgaa gcgccatctg gtggacctgg acggagcgca tctggccgat tcccgcagca 
ccgcgttcgc cgacgcgttc ccgccggtcg atgtcgtgct caactcgctc accggtgaat 
tcctcgacgc gtccgtcggc ctgctcgcgg cgggtggccg gttcatcgag atggggaaga 

cggacatccg gcacgccgtc cagcagccgt tcgacctgat ggacgccggc cccgaccgga ^^^^^ 
tgcagcggat catcgtcgag ctgctcggcc tgttcgcgcg cgacgtgctg cacccgctgc 
cggtccacgc ctgggacgtg cggcaggcgc gggaggcgtt cggctggatg agcagcgggc 
gtcacaccgg caagctggtg ctgacggtcc cgcggccgct ggatcccgag ggggccgtcg 

tcatcaccgg cggctccggc accctcgccg gcatcctcgc ccgccacctg ggccaccccc 67380 

acacctacct gctctcccgc accccacccc ccgacaccac ccccggcacc cacctcccct 67440 

gcgacgtcgg cgacccccac caactcgcca ccaccctcgc ccgcatcccc caacccctca 67500 

ccgccgtctt ccacaccgcc ggaaccctcg acgacgccct gctcgacaac ctcacccccg 67560 

accgcgtcga caccgtcctc aaacccaagg ccgacgccgc ctggcacctg caccggctca 67620 

cccgcgacac cgacctcgcc gcgttcgtcg tctactccgc ggtcgccggc ctcatgggca 67680 

gcccggggca gggcaactac gtcgcggcga acgcgttcct cgacgcgctc gccgaacacc 6774 0 

gccgtgcgca agggctgccc gcgcagtccc tcgcatgggg catgtgggcg gacgtcagcg 67800 

cgctcaccgc gaaactcacc gacgcggacc gccagcgcat ccggcgcagc ggattcccgc 67860 

cgttgagcgc cgcggacggc atgcggctgt tcgacgcggc gacgcgtacc ccggaaccgg 67920 

tcgtcgtcgc gacgaccgtc gacctcaccc agctcgacgg cgccgtcgcg ccgttgctcc ^^^^^ 
gcggtctggc cgcgcaccgg gccgggccgg cgcgcacggt cgcccgcaac gccggcgaag 
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agcccctggc cgtgcgtctt gccgggcgta ccgccgccga gcagcggcgc atcatgcagg 68100 

aggtcgtgct ccgccacgcg gccgcggtcc tcgcgtacgg gctgggcgac cgcgtggcgg 68160 

cggaccgtcc gttccgcgag ctcggtttcg attcgctgac cgcggtcgac ctgcgcaatc 68220 

ggctcgcggc cgagacgggg ctgcggctgc cgacgacgct ggtgttcagc cacccgacgg 68280 

cggaggcgct caccgcccac ctgctcgacc tgatcgacgc tcccaccgcc cggatcgccg 68340 

gggagtccct gcccgcggtg acggccgctc ccgtggcggc cgcgcgggac caggacgagc 68400 

cgatcgccat cgtggcgatg gcgtgccggc tgcccggtgg tgtgacgtcg cccgaggacc 68460 

tgtggcggct cgtcgagtcc ggcaccgacg cgatcaccac gcctcctgac gaccgcggct 6852 0 

g99acgtcga cgcgctgtac gacgcggacc cggacgcggc cggcaaggcg tacaacctgc 68580 

ggggcggtta cctggccggg gcggcggagt tcgacgcggc gttcttcgac atcagtccgc 68640 

gcgaagcgct cggcatggac ccgcagcaac gcctgctgct cgaaacggcg tgggaggcga 68 700 

tcgagcgcgg ccggatcagt ccggcgtcgc tccgcggccg ggaggtcggc gtctatgtcg 68760 

gtgcggccgc gcagggctac gggctgggcg ccgaggacac cgagggccac gcgatcaccg 68820 

gtggttccac gagcctgctg tccggacggc tggcgtacgt gctcgggctg gagggcccgg 68880 

cggtcaccgt ggacacggcg tgctcgtcgt ctctggtcgc gctgcatctg gcgtgccagg 6 8 940 

ggctgcgcct gggcgagtgc gaactcgctc tggccggagg ggtctccgta ctgagttcgc 69000 

cggccgcgtt cgtggagttc tcccgccagc gcgggctcgc ggccgacggg cgctgcaagt 69060 

cgttcggcgc gggcgcggac ggcacgacgt ggtccgaggg cgtgggcgtg ctcgtactgg 69120 

aacggctctc cgacgccgag cggctcgggc acaccgtgct cgccgtcgtc cgcggcagcg 6 9180 

ccgtcacgtc cgacggcgcc tccaacggcc tcaccgcgcc gaacgggctc tcgcagcagc 6924 0 

gggtcatccg gaaggcgctc gccgcggccg ggctgaccgg cgccgacgtg gacgtcgtcg 6 9300 

^ aggggcacgg caccggcacc cggctcggcg acccggtcga ggcggacgcg ctgctcgcga 69360 

cgtacgggca ggaccgtccg gcaccggtct ggctgggctc gctgaagtcg aacatcggac 69420 

atgccacggc cgcggccggt gtcgcgggcg tcatcaagat ggtgcaggcg atcggcgcgg 6 9480 

gcacgatgcc gcggacgctg catgtggagg agccctcgcc cgccgtcgac tggagcaccg 69540 

gacaggtgtc cctgctcggc tccaaccggc cctggccgga cgacgagcgt ccgcgccggg 6 9600 

cggccgtctc cgcgttcggg ctcagcggga cgaacgcgca cgtcatcctg gaacagcacc 69660 

gtccggcgcc cgtggcgtcc cagccgcccc ggccgccccg tgaggagtcc cagccgctgc 69720 

cgtgggtgct ctccgcgcgg actccggccg cgctgcgggc ccaggcggcc cggctgcgcg 69780 

accacctcgc ggcggcaccg gacgcggatc cgttggacat cgggtacgcg ctggccacca 69840 

y gccgcgccca gttcgcccac cgtgccgcgg tcgtcgccac caccccggac ggattccgtg 69900 

^ ccgcgctcga cggcctcgcg gacggcgcgg aggcgcccgg agtcgtcacc gggaccgctc 69960 

aggagcggcg cgtcgccttc ctcttcgacg gccagggcgc ccagcgcgcc ggaatggggc 70020 

Sj gcgagctcca ccgccggttc cccgtcttcg ccgccgcgtg ggacgaggtc tccgacgcgt 70080 

Q tcggcaagca cctcaagcac tcccccacgg acgtctacca cggcgaacac ggcgctctcg 70140 

■y^ cccatga.cac cctgtacgcc caggccggcc tgttcacgct cgaagtggcg ctgctgcggc 70200 

tgctggagca ctggggggtg cggccggacg tgctcgtcgg gcactccgtc ggcgaggtga 70260 

ccgcggcgta cgcggcgggg gtgctcaccc tggcggacgc gacggagttg atcgtggccc 70320 

gggggcgggc gctgcgggcg ctgccgcccg gggcgatgct cgccgtcgac ggaagcccgg 70380 

cggaggtcgg cgcccgcacg gatctggaca tcgccgcggt caacggcccg tccgccgtgg 70440 

tgctcgccgg ttcgccggac gatgtggcgg cgttcgaacg ggagtggtcg gcggccgggc 70500 

ggcgcacgaa acggctcgac gtcgggcacg cgttccactc ccggcacgtc gacggtgcgc 70560 

tcgacggctt ccgtacggtg ctggagtcgc tcgcgttcgg cgcggcgcgg ctgccggtgg 70620 

tgtccacgac gacgggccgg gacgccgcgg acgacctcat aacgcccgcg cactggctgc 70680 

gccatgcgcg tcggccggtg ctgttctcgg atgccgtccg ggagctggcc gaccgcggcg 70740 

tcaccacgtt cgtggccgtc ggcccctccg gctccctggc gtcggccgcg gcggagagcg 70800 

ccggggagga cgccgggacc taccacgcgg tgctgcgcgc ccggaccggt gaggagaccg 70860 

cggcgctgac cgccctcgcc gagctgcacg cccacggcgt cccggtcgac ctggccgcgg 70920 

tactggccgg tggccggcca gtggaccttc ccgtgtacgc gttccagcac cgttcctact 70980 

ggctggcccc ggccgtggcg ggggcgccgg ccaccgtggc ggacaccggg ggtccggcgg 71040 

agtccgagcc ggaggacctc accgtcgccg agatcgtccg tcggcgcacc gcggcgctgc 71100 

tcggcgtcac ggaccccgcc gacgtcgatg cggaagcgac gttcttcgcg ctcggtttcg 71160 

actcactggc ggtgcagcgg ctgcgcaacc agctcgcctc ggcaaccggg ctggacctgc 71220 

cggcggccgt cctgttcgac cacgacaccc cggccgcgct caccgcgttc ctccaggacc 71280 

ggatcgaggc cggccaggac cggatcgagg ccggcgagga cgacgacgcg cccaccgtgc 71340 

tctcgctcct ggaggagatg gagtcgctcg acgccgcgga catcgcggcg acgccggccc 71400 

cggagcgtgc ggccatcgcc gatctgctcg acaagctcgc ccatacctgg aaggactacc 71460 

gatgagcacc gatacgcacg agggaacgcc gcccgccggc cgctgcccat tcgcgatcca 71520 
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ggacggtcac 

caagcactgg 

cagctcggcc 

ggactcaccg 

ggcgcgcaag 

ggccgcggga 

catcaacgcg 

cgacatcacc 

gcacgcgctg 

gctggcctcg 

cgcgacgctg 

cgcactgctc 

caacgcggtc 

ctgtgtcgag 

gctctactcg 

gacgcgcccg 

gcacatcgcc 

cgtccggctg 

gctgcgggtc 

gggacgacgg 

acccagcgct 

gtcggcgcga 

gtgcacgcct 

cacggcatcc 

atgaccttct 

acggagctgt 

atgctcgcgc 

gacgtcatcg 

agcgaacggc 

gtcgcggagg 

catggcttca 

gtcgccgcgc 

gccgcggtgc 

cccttcaccc 

acgaacagct 

cgccgctccg 

tccgcgtccg 

gcgaggtgcc 

cacgcttcgc 

agcagatcgg 

tgcacccgca 

atgagcctca 

acccgccaca 

tcccggaacg 

gcccagacca 

agccaccgct 

agcggcaatg 

ccgcattcga 

gcgtggaccg 

caggactgga 

gtggtccggt 

tgttcggacc 

acggctccgg 

tcggccgcgc 

ccctgctcgc 

cgcccgtcca 

tcccgcgacg 

cgctcgatgg 



cgcgccatcc 

ctggtcgccg 

gcgccgtccg 

gagcacaacc 

cgggaggact 

cccggcaccg 

ctgtacgggc 

ggctcggccg 

cggctggtcc 

gccgacgacg 

ctgttcgccg 

agccaccccg 

gaggagatgc 

gacgtcgatg 

acggccaacc 

ctggagggca 

cgggtgctca 

gccggcgacg 

acctgggggg 

tcgcgcacat 

gctacctgcg 

acatcggcat 

tcgagcccgc 

cgggccaggc 

atcccgacgc 

tgcgcacgct 

aactgcccga 

cggagcgcgg 

aggtcttcgc 

tccacgacat 

ccgtggtcgc 

ggcgggtggc 

ggacggcggc 

ccagcttgcg 

ggctggcgat 

cctcggtcag 

aggactcccc 

gtgcgcggcg 

ccatgtcggc 

cggcctcgtc 

gcgtcatcac 

gcccctcgtc 

gggccaggcc 

cgttgtacgc 

tgtgcagtcc 

ccgcccggtc 

cggcggccat 

cggcggcggt 

cctcgtcggc 

cggcatcggt 

ccgtcgtgac 

agccgcgcag 

aaaacgaggc 

cgggatagat 

tcggggcggc 

tcgccagcca 

cggtgagcag 

cggcggtgtc 



tggagagcgg 

ccgccgagga 

agatgctgcc 

gctaccggca 

tcgtcgccga 

acctcatccc 

tcacccctga 

atctggacag 

gcgcgaagcg 

gcgagatctc 

gccacgactc 

agcagcaggc 

tccgtttcct 

tgcggggcgt 

gcgaccccga 

acttcgcgtt 

tcaaggtcgc 

tgccgatgaa 

cggcatgagt 

caacgcgggc 

ccacggtgtc 

gttcacgctt 

gcccgtgccg 

ggaccagtgc 

cacgctgatg 

cggcctcaac 

cgtcagcgag 

tatcgaggcc 

cggcctcgag 

cgacggcgcg 

cgagcaggaa 

cggctgagcg 

tcagccggcg 

gaacacgttg 

ctccttgttg 

cgatgtgatc 

accgagccgc 

gaacagtccc 

gaggacgcgg 

gagcagttcg 

ccgcgcccgg 

acggccgcgg 

cggcacgtcg 

cgcccggtac 

gaagaggctg 

caggtcgccc 

cccccaggag 

caggtcgccg 

cggggtccgc 

gtcctcggcg 

ccgggagtgc 

cgcgttgctc 

gacctcgtcc 

cagcgcgagg 

ggagcgctgg 

gcaggcgagc 

ctcgggcaca 

gacgcgcagt 



cacggtgggt 

cgtcaagctg 

cgaccggcgg 

gaagatcgcg 

ggccgccgac 

cgggtacgcc 

ggagggggcc 

cgtcaagacg 

tgacgagcgg 

gctcagcgac 

ggtgcagcag 

ggcgctgcgc 

gcccgtcaac 

gcgcatccgt 

ggtgttcccg 

cggccacggc 

ctgcctgcgg 

cgaggggctc 

cacccggtgg 

gaggcgcagt 

gacctgcgcc 

ttcgcgcatc 

ttcgcggcgc 

gcggtctccg 

tccggtttcc 

ggcggctaca 

gagatcgaaa 

atcggcctgc 

gacaccgact 

ctcgaggagg 

ccgctgttcg 

ccgtcggggc 

tcggacagtt 

gtgaggtgct 

gtgcgcccga 

cgctgcgccg 

cggaggagcg 

cgcgcacggc 

gccagctcgt 

atccgcttgg 

gaccccatcg 

ccgagcagca 

acggaccagc 

cgcccggccg 

tcggaggtct 

agtcggatcg 

ggcacgaccc 

cggcgcagcg 

atgttgtcgt 

tagagcaggg 

tggagcacgt 

agggccttgt 

tcggccggcg 

gacaggtccg 

gccgccagga 

gacacggcgt 

tgccggccgg 

gcggcgtgga 



tcgttcgacc 

gtcaccaacg 

cccggctggt 

ggggacttca 

gcctgcctgg 

aagcggctgc 

gtgctggagg 

ctgaccgacg 

ggcgaggacc 

gacgaggcga 

atggtcggct 

gcgcgcccgg 

cagatgggcg 

gcgggcgaca 

cagcccgaca 

attcacaagt 

ttgttcgagc 

gggctgttca 

agacgttgcg 

tcctctaccg 

cgggggacgt 

tggagtgtcc 

tgcgggcgaa 

acagctccgg 

acgcggatgc 

ccgccgagga 

cccctgtggt 

tgaaggtcga 

ggccccgtat 

tcgtcacgct 

ccggcacggg 

cgcggccgtc 

ccttgggcag 

gttccaccgt 

ccgcggcgtg 

gcgtcacgtc 

gcacggctcc 

tgtgccgccg 

actggtcgcg 

ccggcggact 

gccgggacag 

gaagcgcttc 

gtcgcatccg 

cgagatggtg 

cctccggcaa 

cggcggccac 

ggggggcgag 

cggcctcggc 

caccggccag 

ccagcaacgc 

actcggcttt 

cggcgacggc 

gatcggccgg 

cgacgcgcag 

cctcggcggc 

gctcgctgga 

atctggcggg 

cggcggggtc 



tgttcggcgt 

atccgcggtt 

tctccgggat 

cactgcgcgc 

acgacatcga 

cctccctcgt 

cacggatgcg 

acttcttcgg 

tgctgcaccg 

cgggcgtgtt 

actgcctcta 

agctggtcga 

taccgcgcgt 

acgtgatccc 

ccttcgatgt 

gtcccggcca 

gtttcccgga 

gcccggccga 

gttgccgaac 

ggagatcttc 

ggtgttcgac 

tggtgtgacc 

cgtgacgcgg 

cacccggaag 

cgcggcccgg 

cgtcgacacc 

ccggctctcc 

cgtggagaag 

ccgccaggtc 

gctccgcggc 

catccaccag 

cgcaccggcg 

ttgctgacgg 

gctggaggtg 

cgacgccacc 

ctgggtgccg 

gcactgggtc 

gagcatgccg 

gcacatgatg 

gtaggccgcc 

ctgctcggag 

ggcggcgtcg 

ctccccgcag 

ttgcccacgg 

cggctcggcg 

ggtgctgctc 

cgcggcctcg 

gcggaacccc 

cttgtcgacc 

catcatggtc 

ggcctcggcc 

gcggtgccgg 

acgcggcgga 

gtgcgcccgg 

ctcgcccggc 

gaggagccgt 

atcgcagagc 

gtcggaggcc 



71580 

71640 

71700 

71760 

71820 

71880 

71940 

72000 

72060 

72120 

72180 

72240 

72300 

72360 

72420 

72480 

72540 

72600 

72660 

72720 

72780 

72840 

72900 

72960 

73020 

73080 

73140 

73200 

73260 

73320 

73380 

73440 

73500 

73560 

73620 

73680 

73740 

73800 

73860 

73920 

73980 

74040 

74100 

74160 

74220 

74280 

74340 

74400 

74460 

74520 

74580 

74640 

74700 

74760 

74820 

74880 

74940 

75000 
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cggtaggcga actccaggta ggtgacggcc tcgtcgagct cgccgcgcag gtggtgctcg 75060 

cgcgcggcgt cggtgaacag cccggcgacc tcggcgccgt gcacccggcc ggtacccatc 75120 

tggtggcggg cgagcacctt gctggccacg ccgcggtccc gcagcagttc cagcgccagc 75180 

tcgtgcaggc cacgccgctc ggcggcggag aggtcgtcga gtacgacgga gcgggccgcg 7524 0 

gggtgcggga accgcccttc ccgcagcagc cgcccctcga ccagctgttc gtgggcctgc 753 QO 

tcgaccgcct cggtgtcgag gccggtcatc cgctggacga gggtgagttc gacactctcg 75360 

ccgagcacgg cggaagctcg ggcgacgctc agcgcggccg ggccgcaacg atagagcgac 75420 

ccgaggtagg cgagccggta cgcccgcccc gcgaccactt ccaggcaccc tgaggtccgt 754 8 0 

gtccgtgcct cccggatgtc gtcgatcagg ccgtggccga ggagcaggtt gccgccggtc 7554 0 

gcccggaacg cctgggccac cacgtcgtcg tgcgcgtcct ggccgaggtg ccggcgcacg 75600 

agttcggtgg tctgcgcctc ggtgagcggg cgcagcgcga tctcctggta gtggcgcaga 75660 

ctcagcagtg ccgcccggaa ttgggagtgg gcgggcgtcg gccggagcag ctcggtcagc 75720 

acgatggcga cacgggcccg gctgatgcgg cgcgcgaggt ggagcaggca gcgcagcgac 75780 

ggcgcgtcgg cgtggtgcac gtcgtcgatg ccgatcagta cgggccgctc cgcggcgagc 75 84 0 

gtcagcaccg tgcgggtgag ttcggtcccc aggcggttgt cgacgtcggc cggcaggttt 75900 

tcgcacgatg ccgtcagccg gaccagctcc ggtgtccggg cggccagctc gggctggtcg 75960 

aggagctggc cgagcatgcc gtacggcagg gcccgctcct ccatggagca caccgcgcga 7602 0 

agggtgacga agccggcctt ggccgcggcg gcgtcgagga gttcggtctt gccgcaggcg 76080 

atcggcccgg tgacggcggc gacgacgccc cgcccgcccc ccgctcgggt gagcgcccgg 76140 

tggagggaac cgaactcgtc atcgcgggcg atcaggtctg ggggagataa gcgcgctatc 76200 

Q acgaatggaa ctacctcgcg accgtcgtgg aaacccatag gcatcacatg gcttgttgat 76260 

%S ctgtacggct gtgattcagc ctggcgggat gctgtgctac agatgggaag atgtgatcta 76320 

gggccgtgcc gttccctcag gagccgaccg cccccggcgc cacccgccgt accccctggg 76380 

^ ccaccagctc ggcgacccgc tcctggtggt cgacgaggta gaagtgcccg ccggggaaga 76440 

Q cctccaccgt ggtcggcgcg gtcgtgtgcc cggcccaggc gtgggcctgc tccaccgtcg 76500 

^ tcttcggatc gtcgtcaccg atgcacaccg tgatcggcgt ctccagcggc ggcgcgggct 76560 

}" cccaccggta cgtctccgcc gcgtagtagt ccgcccgcaa cggcgccagg atcagcgcgc 76620 

^ gcatttcgtc gtccgccatc acatcggcgc tcgtcccgcc gaggccgatg accgccgcca 76680 

0^ gcagctcgtc gtcggacgcg aggtggtcct ggtcggcgcg cggctgcgac . ggcgcccgcc 76740 

ggcccgagac gatcaggtgc gccaccggga gccgctgggc cagctcgaac gcgagtgtcg 76800 

Q cgcccatgct gtggccgaac agcaccagcg gacggtccag ccccggcttc aacgcctcgg 76860 

^ ccacgaggcc ggcgagaaca cgcaggtcgc gcaccgcctc ctcgtcgcgg cggtcctggc 76920 

fIJ ggccggggta ctgcacggcg tacacgtccg ccaccggggc gagcgcacgg gccagcggaa 76980 

%A ggtagaacgt cgccgatccg ccggcgtggg gcagcagcac cacccgtacc ggggcctcgg 77040 

Q gcgtggggaa gaactgccgc agccagagtt ccgagctcac cgcaccccct cggccgcgac 77100 

r: ctggggagcc cggaaccggg tgatctcggc caagtgcttc tcccgcatct ccgggtcggt 77160 

-"^"^ cacgccccat ccctcctccg gcgccagaca gaggacgccg actttgccgt tgtgcacatt 77220 

gcgatgcaca tcgcgcaccg ccgacccgac gtcgtcgagc gggtaggtca ccgacagcgt 77280 

cgggtgcacc atccccttgc agatcaggcg gttcgcctcc cacgcctcac gatagttcgc 77340 

gaagtgggta ccgatgatcc gcttcacgga catccacagg taccgattgt caaaggcgtg 77400 

ctcgtatccc gaggttgacg cgcaggtgac gatcgtgcca ccccgacgtg tcacgtagac 77460 

actcgcgccg aacgtcgcgc gccccgggtg ctcgaacacg atgtcgggat cgtcaccgcc 77520 

ggtcagctcc cggatc 77536 



<210> 2 
<211> 360 
<212> PRT 

<213> Streptomyces sp. 
<400> 2 

Met Thr lie Val Lys Cys Leu Val Trp Asp Leu Asp Asn Thr Leu Trp 
15 10 15 



Arg Gly Thr Val Leu Glu Asp Asp Glu Val Val Leu Thr Asp Glu lie 
20 25 30 
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Arg Glu Val lie Thr Thr Leu Asp Asp Arg Gly lie Leu Gin Ala Val 
35 40 45 

Ala Ser Lys Asn Asp His Asp Leu Ala Trp Glu Arg Leu Glu Arg Leu 
50 55 60 

Gly Val Ala Glu Tyr Phe Val Leu Ala Arg lie Gly Trp Gly Pro Lys 
65 70 75 . 80 

Ser Gin Ser Val Arg Glu lie Ala Thr Glu Leu Asn Phe Ala Pro Thr 
85 90 95 

Thr lie Ala Phe lie Asp Asp Gin Pro Ala Glu Arg Ala Glu Val Ala 
100 105 110 

Phe His Leu Pro Glu Val Arg Cys Tyr Pro Ala Glu Gin Ala Ala Thr 
115 120 125 

Leu Leu Ser Leu Pro Glu Phe Ser Pro Pro Val Ser Thr Val Asp Ser 
130 135 140 

Arg Arg Arg Arg Leu Met Tyr Gin Ala Gly Phe Ala Arg Asp Gin Ala 
145 150 155 160 

Arg Glu Ala Tyr Ser Gly Pro Asp Glu Asp Phe Leu Arg Ser Leu Asp 
165 170 175 

Leu Ser Met Thr lie Ala Pro Ala Gly Glu Glu Glu Leu Ser Arg Val 
180 185 190 

Glu Glu Leu Thr Leu Arg Thr Ser Gin Met Asn Ala Thr Gly Val His 
195 200 205 

Tyr Ser Asp Ala Asp Leu Arg Ala Leu Leu Thr Asp Pro Ala His Glu 
210 215 220 

Val Leu Val Val Thr Met Gly Asp Arg Phe Gly Pro His Gly Ala Val 
225 230 235 240 

Gly lie lie Leu Leu Glu Lys Lys Pro Ser Thr Trp His Leu Lys Leu 
245 250 255 

Leu Ala Thr Ser Cys Arg Val Val Ser Phe Gly Ala Gly Ala Thr lie 
260 265 270 

Leu Asn Trp Leu Thr Asp Gin Gly Ala Arg Ala Gly Ala His Leu Val 
275 280 285 

Ala Asp Phe Arg Arg Thr Asp Arg Asn Arg Met Met Glu lie Ala Tyr 
290 295 300 

Arg Phe Ala Gly Phe Ala Asp Ser Asp Cys Pro Cys Val Ser Glu Val 
305 310 315 320 

Ala Gly Ala Ser Ala Ala Gly Val Glu Arg Leu His Leu Glu Pro Ser 
325 330 335 
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Ala Arg Pro Ala Pro Thr Thr Leu Thr Leu Thr Ala Ala Asp lie Ala 
340 345 350 

Pro Val Thr Val Ser Ala Ala Gly 
355 360 



<210> 3 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Linker 
<400> 3 

ctagtgggca gatctggcag ct 



<210> 4 
Q <211> 14 
^ <212> DNA 

<213> Artificial Sequence 

fj <220> 

^ <223> Description of Artificial Sequence: Linker 

1^ 

<400> 4 

01 gccagatctg ccca 14 

Q 

^ <210> 5 

^1 <211> 12 

l^, <212> DNA 

^ <213> Artificial Sequence 

<223> Description of Artificial Sequence: Linker 
<400> 5 

gggatgcatg gc 12 



<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Linker 
<400> 6 

ttaagccatg catccccatg 



<210> 7 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 7 

cgactcacta gtgggcagat ctgg 

<210> 8 
<211> 29 
<212> DNA 

<213> Artificial Sequence. 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 8 

cacgcctagg ccggtcggtc tcgggccac 

<210> 9 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

gcggctagct gctcgcccat cgcgggatgc 

<210> 10 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence : primer 
<400> 10 

gatgtacagc tcgagtcggc acgcccggcc gcatc 

<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

cgactcactt aagccatgca tec 



<210> 12 
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<211> 31 
<2X2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 12 

atcctaggcg ggcrggygtg tcgtccttcg g 



<210> 13 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

atgctagccg ccgcgttccc cgtcttcgcg eg 



<210> 14 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

atgctagcgg attcgtcggt ggtgttcgcc ga 



<210> 15 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

atctcgagcc agtascgctg gtgytggaag g 



<210> 16 
<211> 4478 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 



28 




29 

ttc tec tac ggt tac ggc acc ggt gcg gac acc gac ggc ttc ggc gcg 719 
Phe Ser Tyr Gly Tyr Gly Thr Gly Ala Asp Thr Asp Gly Phe Gly Ala 
225 230 235 

acc ggc teg cag acc agt gtg etc tec ggc egg etg teg tac ttc tac 767 
Thr Gly Ser Gin Thr Ser Val Leu Ser Gly Arg Leu Ser Tyr Phe Tyr 
240 245 250 255 

ggt ctg gag ggt ccg gcg gtc aeg gtc gac aeg gcg tgt teg teg teg 815 
Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser 
260 265 270 

ctg gtg gcg ctg cac cag gee ggg cag teg ctg cgc tec ggc gaa tgc 863 
Leu Val Ala Leu His Gin Ala Gly Gin Ser Leu Arg Ser Gly Glu Cys 
275 280 285 

teg etc gee ctg gtc ggc ggc gtc aeg gtg atg gcg tet cec ggc ggc 911 

Ser Leu Ala Leu Val Gly Gly Val Thr Val Met Ala Ser Pro Gly Gly 

290 295 300 

o 

hB ttc gtg gag ttc tec egg cag cgc ggc etc gcg ccg gac ggc* egg gcg 959 

"-^ Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala 
305 310 315 



M aag gcg ttc ggc gcg ggt gcg gac ggc aeg age ttc gee gag ggt gee 1007 



hi 



Lys Ala Phe Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala 
320 325 330 335 



B ggt gtg ctg ate gtc gag agg etc tec gac gee gaa cgc aac ggt cac 1055 
Q Gly Val Leu lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His 
p 340 345 350 

V 9*^*^ ^^0^ 

^ Thr Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 

355 360 365 



o 



tec aac ggg ctg teg gcg ccg aac ggg ccg teg cag gag egg gtg ate 1151 
Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val lie 
370 375 380 

egg cag gee ctg gee aac gee ggg etc acc ccg gcg gac gtg gac gee 1199 
Arg Gin Ala Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp Val Asp Ala 
385 390 395 

gtc gag gee cac ggc ace ggc ace agg ctg ggc gac ecc ate gag gea 1247 
Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala 
400 405 410 415 

cag gcg gta etg gee acc tac gga cag gag cgc gee acc cec ctg ctg 1295 
Gin Ala Val Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu 
420 425 430 

ctg ggc teg ctg aag tec aac ate ggc cac gee cag gee gcg tec ggc 1343 
Leu Gly Ser Leu Lys Ser Asn lie Gly His Ala Gin Ala Ala Ser Gly 
435 440 445 
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gtc gcc ggc ate ate aag atg gtg cag gcc etc egg eae ggg gag etg 13 91 
Val Ala Gly lie lie Lys Met Val Gin Ala Leu Arg His Gly Glu Leu 
450 455 460 

ccg ecg aeg etg eae gcc gac gag ecg teg ecg cac gtc gac tgg aeg 14 3 9 
Pro Pro Thr Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr 
465 470 475 

gcc ggc gcc gtc gaa etg etg aeg teg gcc egg ccg tgg cec gag ace 14 87 
Ala Gly Ala Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr 
480 485 490 495 

gac egg cet agg egg gea ggc gtg teg tec tte ggg ate agt ggc ace 1535 
Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly lie Ser Gly Thr 
500 505 510 

aac gcc eae gtc ate etg gaa age gea ecc cec act cag cet geg gac 1583 
Asn Ala His Val lie Leu Glu Ser Ala Pro Pro Thr Gin Pro Ala Asp 
515 520 525 

aac geg gtg ate gag egg gea ccg gag tgg gtg ccg ttg gtg att teg 1631 
^ Asn Ala Val lie Glu Arg Ala Pro Glu Trp Val Pro Leu Val lie Ser 
^ 530 535 540 



yj 

ffl 



gee agg ace cag teg get ttg act gag cac gag ggc egg ttg cgt geg 1679 
Ala Arg Thr Gin Ser Ala Leu Thr Glu His Glu Gly Arg Leu Arg Ala 
545 550 555 



£^ tat etg geg geg teg ecc ggg gtg gat atg egg get gtg gea teg aeg 1727 
Q Tyr Leu Ala Ala Ser Pro Gly Val Asp Met Arg Ala Val Ala Ser Thr 
03 560 565 570 575 

ry 

\.| 9^9 atg aca egg teg gtg tte gag cac cgt gcc gtg etg etg gga 17 75 

Q Leu Ala Met Thr Arg Ser Val Phe Glu His Arg Ala Val Leu Leu Gly 

580 585 590 

gat gac ace gtc ace ggc ace get gtg tct gac cet egg geg gtg tte 1823 
Asp Asp Thr Val Thr Gly Thr Ala Val Ser Asp Pro Arg Ala Val Phe 
595 600 605 

gtc tte ecg gga cag ggg teg cag cgt get ggc atg ggt gag gaa etg 18 71 
Val Phe Pro Gly Gin Gly Ser Gin Arg Ala Gly Met Gly Glu Glu Leu 
610 615 620 

gee gee geg tte cec gtc tte geg egg ate eat cag cag gtg tgg gac 1919 
Ala Ala Ala Phe Pro Val Phe Ala Arg lie His Gin Gin Val Trp Asp 
625 * 630 635 

etg etc gat gtg ecc gat etg gag gtg aac gag ace ggt tae gee cag 1967 
Leu Leu Asp Val Pro Asp Leu Glu Val Asn Glu Thr Gly Tyr Ala Gin 
640 645 650 655 

ccg gee etg tte gea atg cag gtg get etg tte ggg etg etg gaa teg 2015 
Pro Ala Leu Phe Ala Met Gin Val Ala Leu Phe Gly Leu Leu Glu Ser 
660 665 670 
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gtg ctg gac ctt ccg aca tac gcc ttc cag cac cag cgc tac tgg etc 273 5 
Val Leu Asp Leu Pro Thr Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu 
900 905 910 

gag teg gea ege ccg gee gca tee gac gcg ggc cac cee gtg ctg ggc 2783 
Glu Ser Ala Arg Pro Ala Ala Ser Asp Ala Gly His Pro Val Leu Gly 
915 920 925 

tec ggt ate gee etc gee ggg teg ccg ggc egg gtg ttc acg ggt tec 2831 
Ser Gly lie Ala Leu Ala Gly Ser Pro Gly Arg Val Phe Thr Gly Ser 
930 935 940 

gtg ccg ace ggt gcg gac cgc gcg gtg ttc gtc gee gag ctg gcg ctg 2879 
Val Pro Thr Gly Ala Asp Arg Ala Val Phe Val Ala Glu Leu Ala Leu 
945 950 955 

gee gcc gcg gac gcg gtc gac tgc gee acg gtc gag egg etc gac ate 2 92 7 
Ala Ala Ala Asp Ala Val Asp Cys Ala Thr Val Glu Arg Leu Asp lie 
960 965 970 975 

gcc tec gtg cee ggc egg ccg ggc cat ggc egg aeg ace gta cag ace 2 975 
Ala Ser Val Pro Gly Arg Pro Gly His Gly Arg Thr Thr Val Gin Thr 
5 980 985 990 



W 

m 



m 



tgg gtc gac gag ccg gcg gac gac ggc egg cgc egg ttc ace gtg eae 3023 
Trp Val Asp Glu Pro Ala Asp Asp Gly Arg Arg Arg Phe Thr Val His 
995 1000 1005 

ace ege ace ggc gac gcc ccg tgg acg ctg cac gcc gag ggg gtg ctg 3071 
Thr Arg Thr Gly Asp Ala Pro Trp Thr Leu His Ala Glu Gly Val Leu 
1010 1015 1020 

cgc cee cat ggc acg gee ctg cee gat gcg gcc gac gcc gag tgg ccc 3119 
Arg Pro His Gly Thr Ala Leu Pro Asp Ala Ala Asp Ala Glu Trp Pro 
1025 1030 1035 

eca ccg ggc gcg gtg ccc gcg gac ggg ctg ccg ggt gtg tgg cgc egg 3167 
Pro Pro Gly Ala Val Pro Ala Asp Gly Leu Pro Gly Val Trp Arg Arg 
1040 1045 1050 1055 

ggg gac cag gtc ttc gee gag gee gag gtg gac gga ccg gac ggt ttc 3215 
Gly Asp Gin Val Phe Ala Glu Ala Glu Val Asp Gly Pro Asp Gly Phe 
1060 1065 1070 

gtg gtg cac ccc gac ctg etc gac gcg gtc ttc tec gcg gtc ggc gac 3263 
Val Val His Pro Asp Leu Leu Asp Ala Val Phe Ser Ala Val Gly Asp 
1075 1080 1085 

gga age cgc cag ccg gee gga tgg cgc gac ctg acg gtg cac gcg teg 3 311 
Gly Ser Arg Gin Pro Ala Gly Trp Arg Asp Leu Thr Val His Ala Ser 
1090 1095 1100 

gac gcc ace gta ctg ege gcc tgc etc ace egg ege ace gac gga, gee 3359 
Asp Ala Thr Val Leu Arg Ala Cys Leu Thr Arg Arg Thr Asp Gly Ala 
1105 1110 1115 
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atg gga ttc gcc gcc ttc gac ggc gcc ggc ctg ccg gta etc acc gcg 3407 
Met Gly Phe Ala Ala Phe Asp Gly Ala Gly Leu Pro Val Leu Thr Ala 
1120 1125 1130 1135 

gag gcg gtg acg ctg egg gag gtg gcg tea ccg tec ggc tec gag gag 3455 
Glu Ala Val Thr Leu Arg Glu Val Ala Ser Pro Ser Gly Ser Glu Glu 
1140 1145 1150 

teg gac ggc ctg cac egg ttg gag tgg etc gcg gtc gee gag gcg gte 3503 
Ser Asp Gly Leu His Arg Leu Glu Trp Leu Ala Val Ala Glu Ala Val 
1155 1160 1165 

tac gac ggt gac ctg ccc gag gga cat gtc ctg ate acc gcc gcc cac 3551 
Tyr Asp Gly Asp Leu Pro Glu Gly His Val Leu lie Thr Ala Ala His 
1170 1175 1180 

ccc gac gac ccc gag gac ata ccc acc cge gcc cac ace ege gee ace 3599 
Pro Asp Asp Pro Glu Asp lie Pro Thr Arg Ala His Thr Arg Ala Thr 
1185 1190 1195 

^ cge gte ctg ace gcc ctg caa cac cac etc acc acc ace gac cac acc 3 64 7 

Arg Val Leu Thr Ala Leu Gin His His Leu Thr Thr Thr Asp His Thr 
h 1200 1205 1210 1215 

Q 

etc ate gtc cac acc ace acc gac ccc gee ggc gcc ace gtc acc ggc 3695 

J^"^ Leu lie Val His Thr Thr Thr Asp Pro Ala Gly Ala Thr Val Thr Gly 
'^"^ 1220 1225 1230 

m . 

5 etc acc cge acc gee cag aac gaa cac ccc cac cge ate ege etc ate 3743 

Q Leu Thr Arg Thr Ala Gin Asn Glu His Pro His Arg lie Arg Leu lie 

m 1235 1240 1245 

gaa acc gac cac ccc cac acc ccc etc ccc ctg gcc caa etc gcc ace 3 791 
Glu Thr Asp His Pro His Thr Pro Leu Pro Leu Ala Gin Leu Ala Thr 
1250 1255 1260 



etc gac cac ccc cac etc cge etc acc cac cac ace etc cac cac ccc 3839 

Leu Asp His Pro His Leu Arg Leu Thr His His Thr Leu His His Pro 
1265 1270 1275 

cac etc acc ccc etc cac acc acc acc cea ccc acc acc acc ccc etc 3887 

His Leu Thr Pro Leu His Thr Thr Thr Pro Pro Thr Thr Thr Pro Leu 
1280 1285 1290 1295 

aac ccc gaa cac gee ate ate ate acc ggc ggc tec ggc acc etc gcc 3 935 

Asn Pro Glu His Ala lie lie lie Thr Gly Gly Ser Gly Thr Leu Ala 
1300 1305 1310 

ggc ate etc gcc cge cac ctg aac cac ccc cac acc tac etc etc tee 3 983 

Gly He Leu Ala Arg His Leu Asn His Pro His Thr Tyr Leu Leu Ser 
1315 1320 1325 



cge acc cea ccc ccc gac gcc acc ccc ggc ace cac etc ccc tgc gac 
Arg Thr Pro Pro Pro Asp Ala Thr Pro Gly Thr His Leu Pro Cys Asp 
1330 1335 1340 



4031 
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gtc ggc gac ccc cac caa etc gcc acc acc etc acc cac ate ccc caa 4079 
Val Gly Asp Pro His Gin Leu Ala Thr Thr Leu Thr His lie Pro Gin 
1345 1350 1355 

ccc etc acc gcc ate ttc cac ace gcc gcc ace etc gac gac ggc ate 412 7 
Pro Leu Thr Ala lie Phe His Thr Ala Ala Thr Leu Asp Asp Gly lie 
1360 1365 1370 1375 

etc cac gee etc ace ccc gac cge etc acc acc gtc etc cac ccc aaa 4175 
Leu His Ala Leu Thr Pro Asp Arg Leu Thr Thr Val Leu His Pro Lys 
1380 1385 1390 

gcc aac gcc gee tgg cac ctg cac cac etc ace caa aac caa ccc etc 4223 
Ala Asn Ala Ala Trp His Leu His His Leu Thr Gin Asn Gin Pro Leu 
1395 1400 1405 

ace cac ttc gtc etc tac tee age gcc gcc gee gtc etc ggc age ccc 4271 
Thr His Phe Val Leu Tyr Ser Ser .Ala Ala Ala Val Leu Gly Ser Pro 
1410 1415 1420 

=Q SS^ caa gga aac tac gcc gee gcc aac gee ttc etc gac gee etc gcc 4319 

Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala 
^ 1425 1430 1435 

W 

^ atg tgg cac ace acc age acc etc acc gga caa etc gac gac gcc gac 4415 

Q Met Trp His Thr Thr Ser Thr Leu Thr Gly Gin Leu Asp Asp Ala Asp 

M 1460 1465 1470 

m 

%j egg gac cge ate cge cgc ggc ggt ttc etc ccg ate acg gac gac gag 4463 

Q Arg Asp Arg lie Arg Arg Gly Gly Phe Leu Pro lie Thr Asp Asp Glu 

'n 1475 1480 1485 

ggc atggggatgc at 44 7 8 

Gly 

<210> 17 
<211> 1488 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 17 

lie Trp Gin Leu Ala Glu Ala Leu Leu Thr Leu Val Arg Glu Ser Thr 
15 10 15 



acc cac cge cac acc etc ggc caa ccc gee acc tec ate gee tgg ggc 4367 
Thr His Arg His Thr Leu Gly Gin Pro Ala Thr Ser lie Ala Trp Gly 
1440 1445 1450 1455 



Ala Ala Val Leu Gly His Val Gly Gly Glu Asp lie Pro Ala Thr Ala 
20 25 30 
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Ala Phe Lys Asp Leu Gly lie Asp Ser Leu Thr Ala Val Gin Leu Arg 
35 40 45 

Asn Ala Leu Thr Glu Ala Thr Gly Val Arg Leu Asn Ala Thr Ala Val 
50 55 60 

Phe Asp Phe Pro Thr Pro His Val Leu Ala Gly Lys Leu Gly Asp Glu 
65 70 75 80 

Leu Thr Gly Thr Arg Ala Pro Val Val Pro Arg Thr Ala Ala Thr Ala 
85 90 95 

Gly Ala His Asp Glu Pro Leu Ala lie Val Gly Met Ala Cys Arg Leu 
100 105 110 

Pro Gly Gly Val Ala Ser Pro Glu Glu Leu Trp His Leu Val Ala Ser 
115 120 125 

Gly Thr Asp Ala lie Thr Glu Phe Pro Thr Asp Arg Gly Trp Asp Val 
130 135 140 

Asp Ala lie Tyr Asp Pro Asp Pro Asp Ala lie Gly Lys Thr Phe Val 
145 150 155 160 

Arg His Gly Gly Phe Leu Thr Gly Ala Thr Gly Phe Asp Ala Ala Phe 
165 170 175 

Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
180 185 190 

Val Leu Leu Glu Thr Ser Tirp Glu Ala Phe Glu Ser Ala Gly lie Thr 
195 200 205 

Pro Asp Ser Thr Arg Gly Ser Asp Thr Gly Val Phe Val Gly Ala Phe 
210 215 220 

Ser Tyr Gly Tyr Gly Thr Gly Ala Asp Thr Asp Gly Phe Gly Ala Thr 
225 230 235 240 

Gly Ser Gin Thr Ser Val Leu Ser Gly Arg Leu Ser Tyr Phe Tyr Gly 
245 250 255 

Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu 
260 ^ 265 270 

Val Ala Leu His Gin Ala Gly Gin Ser Leu Arg Ser Gly Glu Cys Ser 
275 280 285 

Leu Ala Leu Val Gly Gly Val Thr Val Met Ala Ser Pro Gly Gly Phe 
290 295 300 

Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala Lys 
305 310 315 320 

Ala Phe Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala Gly 
325 330 335 
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Val Leu lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His Thr 
340 345 350 

val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser 
355 360 365 

Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val He Arg 
370 375 380 



Gin Ala Leu Ala Asn Ala Gly Leu 



Thr Pro Ala Asp Val Asp Ala Val 



385 



390 395 



400 



Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin 
405 410 415 

Ala val Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu Leu 
420 425 430 

Glv ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ser Gly Val 
435 440 445 

Ala Gly He He Lys Met Val Gin Ala Leu Arg His Gly Glu Leu Pro 
450 455 460 

pro Thr Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr Ala 
465 470 475 480 

Glv Ala Val Glu Leu Leu Thr - Ser Ala Arg Pro Trp Pro Glu Thr Asp 
485 490 .495 

Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn 
500 505 510 

Ala His val He Leu Glu Ser Ala Pro Pro Thr Gin Pro Ala Asp Asn 
515 520 525 

Ala Val He Glu Arg Ala Pro Glu Trp Val Pro Leu Val He Ser Ala 
530 535 540 

Ara Thr Gin Ser Ala Leu Thr Glu His Glu Gly Arg Leu Arg Ala Tyr 
545 550 555 560 

Leu Ala Ala Ser Pro Gly Val Asp Met Arg Ala Val Ala Ser Thr Leu 
565 570 575 

Ala Met Thr Arg Ser Val Phe Glu His Arg Ala Val Leu Leu Gly Asp 
580 585 590 

Asp Thr Val Thr Gly Thr Ala Val Ser Asp Pro Arg Ala Val Phe Val 
595 600 605 

Phe pro Gly Gin Gly Ser Gin Arg Ala Gly Met Gly Glu Glu Leu Ala 
610 615 620 

Ala Ala Phe Pro Val Phe Ala Arg He His Gin Gin Val Trp Asp Leu 
625 630 635 640 
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Leu Asp Val Pro Asp Leu Glu Val Asn Glu Thr Gly Tyr Ala Gin Pro 
645 650 655 

Ala Leu Phe Ala Met Gin Val Ala Leu Phe Gly Leu Leu Glu Ser Trp 
660 665 670 

Gly Val Arg Pro Asp Ala Val lie Gly His Ser Val Gly Glu Leu Ala 
675 680 685 

Ala Ala Tyr Val Ser Gly Val Trp Ser Leu Glu Asp Ala Cys Thr Leu 
690 695 700 

Val Ser Ala Arg Ala Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Val 
705 710 715 720 

Met Val Ala Val Pro Val Ser Glu Asp Glu Ala Arg Ala Val Leu Gly 
725 730 735 

Glu Gly Val Glu lie Ala Ala Val Asn Gly Pro Ser Ser Val Val Leu 
740 745 750 

Ser Gly Asp Glu Ala Ala Val Leu Gin Ala Ala Glu Gly Leu Gly Lys 
755 760 765 

Trp Thr Arg Leu Ala Thr Ser His Ala Phe His Ser Ala Arg Met Glu 
770. 775 780 

Pro Met Leu Glu Glu Phe Arg Ala Val Ala Glu Gly Leu Thr Tyr Arg 
785 790 795 800 

Thr Pro Gin Val Ser Met Ala Val Gly Asp Gin Val Thr Thr Ala Glu 
805 810 815 

Tyr Trp Val Arg Gin Val Arg Asp Thr Val Arg Phe Gly Glu Gin Val 
820 825 830 

Ala Ser Tyr Glu Asp Ala Val Phe Val Glu Leu Gly Ala Asp Arg Ser 
835 840 845 

Leu Ala Arg Leu Val Asp Gly Val Ala Met Leu His Gly Asp His Glu 
850 855 860 

lie Gin Ala Ala lie Gly Ala Leu Ala His Leu Tyr Val Asn Gly Val 
865 870 875 880 

Thr Val Asp Trp Pro Ala Leu Leu Gly Asp Ala Pro Ala Thr Arg Val 
885 890 895 

Leu Asp Leu Pro Thr Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu 
900 905 910 

Ser Ala Arg Pro Ala Ala Ser Asp Ala Gly His Pro Val Leu Gly Ser 
915 920 925 

Gly lie Ala Leu Ala Gly Ser Pro Gly Arg Val Phe Thr Gly Ser Val 
930 935 940 
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Pro Thr Gly Ala Asp Arg Ala Val Phe Val Ala Glu Leu Ala Leu Ala 
945 950 955 960 

Ala Ala Asp Ala Val Asp Cys Ala Thr Val Glu Arg Leu Asp lie Ala 
965 970 975 

Ser Val Pro Gly Arg Pro Gly His Gly Arg Thr Thr Val Gin Thr Trp 
980 985 990 

Val Asp Glu Pro Ala Asp Asp Gly Arg Arg Arg Phe Thr Val His Thr 
995 1000 1005 

Arg Thr Gly Asp Ala Pro T2rp Thr Leu His Ala Glu Gly Val Leu Arg 
1010 1015 1020 

Pro His Gly Thr Ala Leu Pro Asp Ala Ala Asp Ala Glu Trp Pro Pro 
025 1030 1035 104 

Pro Gly Ala Val Pro Ala Asp Gly Leu Pro Gly Val Tirp Arg Arg Gly 
1045 1050 1055 

Asp Gin Val Phe Ala Glu Ala Glu Val Asp Gly Pro Asp Gly Phe Val 
1060 1065 1070 

Val His Pro Asp Leu Leu Asp Ala Val Phe Ser Ala Val Gly Asp Gly 
1075 1080 1085 

Ser Arg Gin Pro Ala Gly Trp Arg Asp Leu Thr Val His Ala Ser Asp 
1090 1095 1100 

Ala Thr Val Leu Arg Ala Cys Leu Thr Arg Arg Thr Asp Gly Ala Met 
105 1110 1115 112 

Gly Phe Ala Ala Phe Asp Gly Ala Gly Leu Pro Val Leu Thr Ala Glu 
1125 1130 1135 

Ala Val Thr Leu Arg Glu Val Ala Ser Pro Ser Gly Ser Glu Glu Ser 
1140 1145 1150 



Asp Gly Leu His Arg Leu Glu Trp Leu Ala Val Ala Glu Ala Val Tyr 
1155 1160 1165 

Asp Gly Asp Leu Pro Glu Gly His Val Leu lie Thr Ala Ala His Pro 
1170 1175 1180 

Asp Asp Pro Glu Asp lie Pro Thr Arg Ala His Thr Arg Ala Thr Arg 
185 1190 1195 120 

Val Leu Thr Ala Leu Gin His His Leu Thr Thr Thr Asp His Thr Leu 
1205 1210 1215 

lie Val His Thr Thr Thr Asp Pro Ala Gly Ala Thr Val Thr Gly Leu 
1220 1225 1230 



Thr Arg Thr Ala Gin Asn Glu His Pro His Arg lie Arg Leu lie Glu 
1235 1240 1245 
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Thr Asp His Pro His Thr Pro Leu Pro Leu Ala Gin Leu Ala Thr Leu 
1250 1255 1260 

Asp His Pro His Leu Arg Leu Thr His His Thr Leu His His Pro His 
265 1270 1275 128 

Leu Thr Pro Leu His Thr Thr Thr Pro Pro Thr Thr Thr Pro Leu Asn 
1285 1290 1295 

Pro Glu His Ala lie lie lie Thr Gly Gly Ser Gly Thr Leu Ala Gly 
1300 1305 1310 

lie Leu Ala Arg His Leu Asn His Pro His Thr Tyr Leu Leu Ser Arg 
1315 1320 1325 

Thr Pro Pro Pro Asp Ala Thr Pro Gly Thr His Leu Pro Cys Asp Val 
1330 1335 1340 

Gly Asp Pro His Gin Leu Ala Thr Thr Leu Thr His lie Pro Gin Pro 
345 1350 1355 136 

Leu Thr Ala lie Phe His Thr Ala Ala Thr Leu Asp Asp Gly lie Leu 
1365 1370 1375 

His Ala Leu Thr Pro Asp Arg Leu Thr Thr Val Leu His Pro Lys Ala 
1380 1385 1390 

Asn Ala Ala Trp His Leu His His Leu Thr Gin Asn Gin Pro Leu. Thr 
1395 1400 1405 

His Phe Val Leu Tyr Ser Ser Ala Ala Ala Val Leu Gly Ser Pro Gly 
1410 1415 1420 

Gin Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Thr 
425 1430 1435 144 

His Arg His Thr Leu Gly Gin Pro Ala Thr Ser lie Ala Trp Gly Met 
1445 1450 1455 

Trp His Thr Thr Ser Thr Leu Thr Gly Gin Leu Asp Asp Ala Asp Arg 
1460 1465 1470 

Asp Arg lie Arg Arg Gly Gly Phe Leu Pro lie Thr Asp Asp Glu Gly 
1475 1480 1485 



<210> 18 
<211> 4571 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 



<220> 
<221> CDS 
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<222> (9) . . (4559) 



<400> 18 

agatctgg cag etc gcc gaa gcg ctg ctg acg etc gtc egg gag age ace 5 0 
Gin Leu Ala Glu Ala Leu Leu Thr Leu Val Arg Glu Ser Thr 
15 10 

gcc gcc gtg etc ggc cac gtg ggt ggc gag gac ate cee gcg acg gcg 98 
Ala Ala Val Leu Gly His Val Gly Gly Glu Asp lie Pro Ala Thr Ala 
15 20 25 30 

gcg ttc aag gac etc ggc ate gac teg etc ace gcg gtc cag ctg ege 146 
Ala Phe Lys Asp Leu Gly lie Asp Ser Leu Thr Ala Val Gin Leu Arg 
35 40 45 

aac gee etc ace gag gcg ace ggt gtg egg ctg aac gcc acg gcg gtc 194 
Asn Ala Leu Thr Glu Ala Thr Gly Val Arg Leu Asn Ala Thr Ala Val 
50 55 60 

^ ttc gac ttc ecg ace ccg cac gtg etc gee ggg aag etc ggc gac gaa 242 

□ Phe Asp Phe Pro Thr Pro His Val Leu Ala Gly Lys Leu Gly Asp Glu 

65 70 75 

\G 

^ ctg ace ggc ace ege gcg cee gtc gtg cee egg ace gcg gee acg gee 2 90 

p Leu Thr Gly Thr Arg Ala Pro Val Val Pro Arg Thr Ala Ala Thr Ala 

y 80 ,85 90 

ggt gcg cac gac gag ceg ctg gcg ate gtg gga atg gcc tge egg ctg 33 8 

Gly Ala His Asp Glu Pro Leu Ala lie Val Gly Met Ala Cys Arg Leu 
^ 95 100 105 110 

U ' 

ffl cee ggc ggg gtc gcg tea cee gag gag ctg tgg cac etc gtg gca tec 3 86 

fU Pro Gly Gly Val Ala Ser Pro Glu Glu Leu Trp His Leu Val Ala Ser 
%| 115 120 125 

Q 

1^ ggc ace gac gee ate acg gag ttc ceg acg gac ege ggc tgg gac gtc 4 34 

Gly Thr Asp Ala lie Thr Glu Phe Pro Thr Asp Arg Gly Trp Asp Val 
130 135 140 

gac gcg ate tac gac ceg gac cee gac gcg ate ggc aag ace ttc gtc 4 82 
Asp Ala lie Tyr Asp Pro Asp Pro Asp Ala lie Gly Lys Thr Phe Val 
145 150 155 

egg cac ggt ggc ttc etc ace ggc gcg aca ggc ttc gac gcg gcg ttc 530 
Arg His Gly Gly Phe Leu Thr Gly Ala Thr Gly Phe Asp Ala Ala Phe 
160. 165 170 

ttc ggc ate age ceg ege gag gcc etc gcg atg gac ceg cag cag egg 578 
Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
175 180 185 190 

gtg etc ctg gag acg teg tgg gag gcg ttc gaa age gee ggc ate ace 626 
Val Leu Leu Glu Thr Ser Trp Glu Ala Phe Glu Ser Ala Gly He Thr 
195 200 205 

ceg gac teg ace ege ggc age gac ace ggc gtg ttc gtc ggc gcc ttc 674 
Pro Asp Ser Thr Arg Gly Ser Asp Thr Gly Val Phe Val Gly Ala Phe 
210 215 220 
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tec tac ggt tac ggc acc ggt gcg gac acc gac ggc ttc ggc gcg acc 722 
Ser Tyr Gly Tyr Gly Thr Gly Ala Asp Thr Asp Gly Phe Gly Ala Thr 
225 230 235 

ggc teg cag acc agt gtg etc tec ggc egg etg teg tac ttc tac ggt 770 
Gly Ser Gin Thr Ser Val Leu Ser Gly Arg Leu Ser Tyr Phe Tyr Gly 
240 245 250 

etg gag ggt eeg gcg gte aeg gtc gac acg gcg tgt teg teg teg etg 818 
Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu 
255 260 265 270 

gtg gcg etg eac cag gee ggg cag teg etg cgc tec ggc gaa tgc teg 86 6 
Val Ala Leu His Gin Ala Gly Gin Ser Leu Arg Ser Gly Glu Cys Ser 
275 280 285 

etc gee etg gtc ggc ggc gtc acg gtg atg gcg tct cec ggc ggc ttc 914 
Leu Ala Leu Val Gly Gly Val Thr Val Met Ala Ser Pro Gly Gly Phe 
290 295 300 

iO 9tg gag ttc tec egg cag cgc ggc etc gcg eeg gac ggc egg gcg aag 962 
^£ Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala Lys 
^ 305 310 315 

gcg ttc ggc gcg ggt gcg gac ggc acg age ttc gee gag ggt gee ggt 1010 
■^f--^ Ala Phe Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala Gly 
^ 320 325 . 330 

0^ . ■ . . . ■ 

2 gtg etg ate gtc gag agg etc tec gac gee gaa cgc aac ggt eac ace 1058 

Q Val Leu lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His Thr 

arj 335 340 345 350 

ftj 

gtc etg gcg gtc gtc cgt ggt teg gcg gtc aac cag gat ggt gee tee 1106 

Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser 
355 360 365 

aac ggg etg teg gcg ccg aac ggg eeg teg cag gag egg gtg ate egg 1154 
Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val lie Arg 
370 375 380 

cag gee etg gee aac gcc ggg etc ace ccg gcg gac gtg gac gee gtc 1202 
Gin Ala Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp Val Asp Ala Val 
385 390 395 

gag gcc eac ggc acc ggc acc agg etg ggc gac cec ate gag gea cag 12 50 
Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin 
400 405 410 

gcg gta etg gcc acc tac gga cag gag cgc gcc acc cec etg etg etg 1298 
Ala Val Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu Leu 
415 420 425 430 

ggc teg etg aag tec aac ate ggc eac gee cag gcc gcg tec ggc gte 1346 
Gly Ser Leu Lys Ser Asn lie Gly His Ala Gin Ala Ala Ser Gly Val 
435 440 445 



hs3 



42 



gcc ggc ate ate aag atg gtg eag gee etc egg cac ggg gag ctg ecg 13 94 
Ala Gly lie lie Lys Met Val Gin Ala Leu Arg His Gly Glu Leu Pro 
450 455 460 

ecg acg ctg cac gee gac gag ecg teg ecg cac gte gae tgg aeg gee 1442 
Pro Thr Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr Ala 
465 470 , 475 

ggc gcc gte gaa ctg ctg acg teg gcc egg ecg tgg eee gag ace gac 14 90 
Gly Ala Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr Asp 
480 485 490 

egg cet agg egg gcg ggc gtg teg tec ttc gga gte age ggc ace aac 1538 
Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser Gly Thr Asn 
495 500 505 510 

gcc cac gte ate ctg gag age gea eee ecc get cag ccc gcg gag gag 1586 

Ala His Val lie Leu Glu Ser Ala Pro Pro Ala Gin Pro Ala Glu Glu 
515 520 525 

Q 

^ gcg eag cet gtt gag acg ecg gtg gtg gcc teg gat gtg ctg ecg ctg 1634 

Ala Gin Pro Val Glu Thr Pro Val Val Ala Ser Asp Val Leu Pro Leu 
p 530 535 540 



w 
m 



gtg ata teg gee aag ace cag ccc gcc ctg ace gaa cac gaa gae egg 1682 
Val lie Ser Ala Lys Thr Gin Pro Ala Leu Thr Glu His Glu Asp Arg 
545 550 555 



^ ctg egc gcc tac ctg gcg gcg teg ccc ggg gcg gat ata egg get . gtg 173 0 

Q Leu Arg Ala Tyr Leu Ala Ala Ser Pro. Gly Ala Asp lie Arg Ala Val 
ffl 560 565 570 

Til 

%l gca teg acg ctg gcg gtg aca egg teg gtg ttc gag cac egc gcc gta 1778 

Ala Ser Thr Leu Ala Val Thr Arg Ser Val Phe Glu His Arg Ala Val 
575 580 585 590 

etc ett gga gat gae ace gte ace ggc ace gcg gtg acc gac ccc agg 1826 

Leu Leu Gly Asp Asp Thr Val Thr Gly Thr Ala Val Thr Asp Pro Arg 
595 600 605 

ate gtg ttt gte ttt ccc ggg cag ggg tgg cag tgg ctg ggg atg ggc 1874 

lie Val Phe Val Phe Pro Gly Gin Gly Trp Gin Trp Leu Gly Met Gly 

610 615 620 

agt gea ctg egc gat teg teg gtg gtg ttc gcc gag egg atg gcc gag 1922 

Ser Ala Leu Arg Asp Ser Ser Val Val Phe Ala Glu Arg Met Ala Glu 
625 630 635 

tgt gcg gcg gcg ttg egc gag ttc gtg gae tgg gat ctg ttc aeg gtt 1970 

Cys Ala Ala Ala Leu Arg Glu Phe Val Asp Trp Asp Leu Phe Thr Val 
640 645 650 

ctg gat gat ecg gcg gtg gtg gac egg gtt gat gtg gte cag ccc get 2018 

Leu Asp Asp Pro Ala Val Val Asp Arg Val Asp Val Val Gin Pro Ala 
655 660 665 670 
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tec tgg gcg atg atg gtt tec ctg gcc gcg gtg tgg cag gcg gcc ggt 2066 
Ser Trp Ala Met Met Val Ser Leu Ala Ala Val Trp Gin Ala Ala Gly 
675 680 685 

gtg egg ccg gat gcg gtg ate ggc eat teg cag ggt gag ate gee gea 2114 
Val Arg Pro Asp Ala Val lie Gly His Ser Gin Gly Glu lie Ala Ala 
690 695 700 

get tgt gtg geg ggt gcg gtg tea eta cgc gat gcc gee egg ate gtg 2162 
Ala Cys Val Ala Gly Ala Val Ser Leu Arg Asp Ala Ala Arg lie Val 
705 710 715 

ace ttg ege age cag geg ate gee egg ggc ctg geg ggc egg ggc geg 2210 
Thr Leu Arg Ser Gin Ala lie Ala Arg Gly Leu Ala Gly Arg Gly Ala 
720 725 730 

atg gea tee gte gcc ctg eee geg cag gat gtc gag ctg gte gac ggg 2258 
Met Ala Ser Val Ala Leu Pro Ala Gin Asp Val Glu Leu Val Asp Gly 
735 740 745 750 

ifl tgg ate gee gcc cae aac ggg eee gee tec ace gtg ate gcg ggc 2306 

Ala Trp lie Ala Ala His Asn Gly Pro Ala Ser Thr Val lie Ala Gly 
^ 755 760 765 

Q ace eeg gaa gcg gtc gac cat gtc etc ace get eat gag gea caa ggg 2 3 54 

Thr Pro Glu Ala Val Asp His Val Leu Thr Ala His Glu Ala Gin Gly 
C! 770 775 780 



ru 

'^4 



9tg egg gtg egg egg ate acc gtc gac tat gcc teg cae ace eeg eac 24 02 

Val Arg Val Arg Arg lie Thr Val Asp Tyr Ala Ser His Thr Pro His 
785 790 795 

gte gag ctg ate cgc gac gaa eta etc gac ate act age gac age age 2450 

Val Glu Leu lie Arg Asp Glu Leu Leu Asp He Thr Ser Asp Ser Ser 
800 805 810 

teg cag ace eeg etc gtg eeg tgg ctg teg ace gtg gac ggc acc tgg 2498 

Ser Gin Thr Pro Leu Val Pro Trp Leu Ser Thr Val Asp Gly Thr Trp 
815 820 825 830 

gtc gac age ccg ctg gac ggg gag tae tgg tac egg aac ctg cgt gaa 2546 

Val Asp Ser Pro Leu Asp Gly Glu Tyr Trp Tyr Arg Asn Leu Arg Glu 

835 840 845 

ccg gte ggt tte eac eee gee gte age cag ttg cag gcc cag ggc gac 2594 

Pro Val Gly Phe His Pro Ala Val Ser Gin Leu Gin Ala Gin Gly Asp 

850 855 860 

ace gtg tte gte gag gtc age gcc age eeg gtg ttg ttg cag gcg atg 2642 

Thr Val Phe Val Glu Val Ser Ala Ser Pro Val Leu Leu Gin Ala Met 
865 870 875 

gac gac gat gte gtc acg gtt gee acg ctg cgt cgt gac gac ggc gae 2690 

Asp Asp Asp Val Val Thr Val Ala Thr Leu Arg Arg Asp Asp Gly Asp 
880 885 890 
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gcc acc egg atg etc acc gee etg gea eag gee tat gtc cac ggc gtc 2738 
Ala Thr Arg Met Leu Thr Ala Leu Ala Gin Ala Tyr Val His Gly Val 
895 900 905 910 

aec gtc gac tgg ccc gee ate etc ggc acc acc aca ace egg gta etg 2786 
Thr Val Asp Trp Pro Ala lie Leu Gly Thr Thr Thr Thr Arg Val Leu 
915 920 925 

gae ctt eeg aec tac gcc tte caa eac eag egg tac tgg etc gag teg 2834 
Asp Leu Pro Thr Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu Ser 
930 935 940 

gea cgc eeg gcc gca tec gae gcg ggc eac ccc gtg etg ggc tec ggt 2882 
Ala Arg Pro Ala Ala Ser Asp Ala Gly His Pro Val Leu Gly Ser Gly 
945 950 955 

ate gcc etc gee ggg teg eeg ggc egg gtg tte aeg ggt tec gtg eeg 293 0 
lie Ala Leu Ala Gly Ser Pro Gly Arg Val Phe Thr Gly Ser Val Pro 
960 965 970 

aec ggt gcg gac cgc gcg gtg tte gtc gcc gag etg gcg etg gcc gee 2978 
Thr Gly Ala Asp Arg Ala Val Phe Val Ala Glu Leu Ala Leu Ala Ala 
975 980 985 990 



4^ 



□ 



gcg gac gcg gtc gac tgc gee aeg gtc gag egg etc gae ate gcc tee 3 026 
Ala Asp Ala Val Asp Cys Ala Thr Val Glu Arg Leu Asp lie Ala Ser 
995 1000 1005 



gtg ccc ggc egg eeg ggc cat ggc egg aeg acc gta eag aec tgg gtc 3 074 
Val Pro Gly Arg Pro Gly His Gly Arg Thr Thr Val Gin Thr Trp Val 
1010 1015 1020 

fU 

%l gae gag eeg gcg gac gae ggc egg cgc egg tte aec gtg cac acc cgc 3122 
Asp Glu Pro Ala Asp Asp Gly Arg Arg Arg Phe Thr Val His Thr Arg 
1025 1030 1035 

ace ggc gae gee eeg tgg aeg etg eac gcc gag ggg gtg etg cgc ccc 3170 
Thr Gly Asp Ala Pro Trp Thr Leu His Ala Glu Gly Val Leu Arg Pro 
1040 1045 1050 

cat ggc aeg gcc etg ccc gat gcg gcc gac gcc gag tgg ccc cea eeg 3218 
His Gly Thr Ala Leu Pro Asp Ala Ala Asp Ala Glu Trp Pro Pro Pro 
1055 1060 1065 1070 

ggc gcg gtg ccc gcg gac ggg etg eeg ggt gtg tgg cgc egg ggg gac 3266 
Gly Ala Val Pro Ala Asp Gly Leu Pro Gly Val Trp Arg Arg Gly Asp 
1075 1080 1085 

cag gtc tte gee gag gee gag gtg gae gga eeg gac ggt tte gtg gtg 3 314 
Gin Val Phe Ala Glu Ala Glu Val Asp Gly Pro Asp Gly Phe Val Val 
1090 1095 1100 



cac ccc gac etg etc gae gcg gtc tte tec gcg gtc ggc gac gga age 
His Pro Asp Leu Leu Asp Ala Val Phe Ser Ala Val Gly Asp Gly Ser 
1105 1110 1115 
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cgc cag ccg gcc gga tgg cgc gac ctg acg gtg cac gcg teg gac gcc 
Arg Gin Pro Ala Gly Trp Arg Asp Leu Thr Val His Ala Ser Asp Ala 
1120 1125 1130 

acc gta ctg cgc gcc tgc etc acc egg egc ace gac gga gcc atg gga 
Thr Val Leu Arg Ala Cys Leu Thr Arg Arg Thr Asp Gly Ala Met Gly 
1135 1140 1145 1150 

ttc gee gee tte gac ggc gee ggc ctg ccg gta etc acc gcg gag gcg 
Phe Ala Ala Phe Asp Gly Ala Gly Leu Pro Val Leu Thr Ala Glu Ala 
1155 1160 1165 

gtg acg ctg egg gag gtg gcg tea ccg tee ggc tec gag gag teg gac 
Val Thr Leu Arg Glu Val Ala Ser Pro Ser Gly Ser Glu Glu Ser Asp 
1170 1175 1180 

ggc etg cac egg ttg gag tgg etc gcg gte gcc gag gcg gtc tac gac 
Gly Leu His Arg Leu Glu Trp Leu Ala Val Ala Glu Ala Val Tyr Asp 
1185 1190 1195 

gac ctg ccc gag gga eat gtc etg ate acc gee gee cac cec gac 
Gly Asp Leu Pro Glu Gly His Val Leu lie Thr Ala Ala His Pro Asp 
1200 1205 1210 

gae ccc gag gac ata cec ace egc gcc cac acc cgc gcc acc cgc gtc 
Asp Pro Glu Asp lie Pro Thr Arg Ala His Thr Arg Ala Thr Arg Val 
1215 1220 1225 1230 

ctg acc gcc ctg caa cac cac etc acc acc acc gac cac acc etc ate 
Leu Thr Ala Leu Gin His His Leu Thr Thr Thr Asp His Thr Leu lie 
1235 1240 1245 

gtc cac ace ace ace gac ccc gcc ggc gee acc gtc ace ggc etc acc 
Val His Thr Thr Thr Asp Pro Ala Gly Ala Thr Val Thr Gly Leu Thr 
1250 1255 1260 

cgc acc gee cag aac gaa cac cce cac cgc ate cgc etc ate gaa acc 
Arg Thr Ala Gin Asn Glu His Pro His Arg lie Arg Leu lie Glu Thr 
1265 1270 1275 

gac cac ccc cac ace ccc etc ccc etg gcc caa etc gee acc etc gac 
Asp His Pro His Thr Pro Leu Pro Leu Ala Gin Leu Ala Thr Leu Asp 
1280 1285 1290 

eae ccc cac etc cgc etc acc cac cac acc etc cac cac ccc cac etc 
His Pro His Leu Arg Leu Thr His His Thr Leu His His Pro His Leu 
1295 1300 1305 1310 

acc ccc etc cac ace ace acc eea cec ace acc acc cec etc aac cec 
Thr Pro Leu His Thr Thr Thr Pro Pro Thr Thr Thr Pro Leu Asn Pro 
1315 1320 1325 



gaa cac gcc ate ate ate acc ggc ggc tec ggc ace etc gee ggc ate 
Glu His Ala lie lie lie Thr Gly Gly Ser Gly Thr Leu Ala Gly lie 
1330 1335 1340 
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etc gcc cgc cac ctg aac cac ccc cac acc tac etc etc tec cgc acc 4082 
Leu Ala Arg His Leu Asn His Pro His Thr Tyr Leu Leu Ser Arg Thr 
1345 1350 1355 

cca ccc cec gac gcc acc ccc ggc acc cac etc ccc tgc gac gtc ggc 4130 
Pro Pro Pro Asp Ala Thr Pro Gly Thr His Leu Pro Cys Asp Val Gly 
1360 1365 1370 

gac ccc cac caa etc gcc acc acc etc acc cac ate ccc caa ccc etc 4178 
Asp Pro His Gin Leu Ala Thr Thr Leu Thr His lie Pro Gin Pro Leu 
1375 1380 1385 1390 



CI 
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acc gee ate tte cac acc gee gcc acc etc gac gac ggc ate etc cac 4226 
Thr Ala lie Phe His Thr Ala Ala Thr Leu Asp Asp Gly lie Leu His 
1395 . 1400 1405 

gcc etc acc ccc gac cgc etc ace ace gtc etc cac ccc aaa gcc aac 4274 
Ala Leu Thr Pro Asp Arg Leu Thr Thr Val Leu His Pro Lys Ala Asn 
1410 1415 1420 

gcc gcc tgg cac ctg cac cac etc ace caa aac caa ccc etc acc cac 4322 
Ala Ala Trp His Leu His His Leu Thr Gin Asn Gin Pro Leu Thr His 
1425 1430 1435 

tte gtc etc tac tec age gee gcc gcc gtc etc ggc age ccc gga caa 4370 
Phe Val Leu Tyr Ser Ser Ala Ala Ala Val Leu Gly Ser Pro Gly Gin 
1440 1445 . 1450 

gga aac tac gcc gee gee aac gcc tte etc gac gcc etc gee acc cac 4418 
Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Thr His 



m 1455 1460 1465 1470 



cgc cac acc etc ggc caa ccc gcc acc tec ate gcc tgg ggc atg tgg 4466 
Arg His Thr Leu Gly Gin Pro Ala Thr Ser lie Ala Trp Gly Met Trp 
1475 1480 1485 

cac ace acc age ace etc acc gga caa etc gac gac gee gac egg gac 4514 
His Thr Thr Ser Thr Leu Thr Gly Gin Leu Asp Asp Ala Asp Arg Asp 
1490 1495 1500 

cgc ate cgc cgc ggc ggt tte etc ecg ate aeg gac gac gag ggc 4559 
Arg lie Arg Arg Gly Gly Phe Leu Pro lie Thr Asp Asp Glu Gly 
1505 1510 1515 

atggggatgc at 4571 



<210> 19 
<211> 1517 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 19 



Gin Leu Ala Glu 
1 

Val Leu Gly His 
20 

Lys Asp Leu Gly 
35 

Leu Thr Glu Ala 
50 

Phe Pro Thr Pro 
65 

Gly Thr Arg Ala 



His Asp Glu Pro 
100 

Gly Val Ala Ser 
115 

Asp Ala lie Thr 
130 

lie Tyr Asp Pro 
145 

Gly Gly Phe Leu 



lie Ser Pro Arg 
180 

Leu Glu Thr Ser 
195 

Ser Thr Arg Gly 
210 

Gly Tyr Gly Thr 
225 

Gin Thr Ser Val 



Gly Pro Ala Val 
260 

Leu His Gin Ala 
275 

Leu Val Gly Gly 
290 



Ala Leu Leu Thr 
5 

Val Gly Gly Glu 



lie Asp Ser Leu 
40 

Thr Gly Val Arg 
55 

His Val Leu Ala 
70 

Pro Val Val Pro 
85 

Leu Ala lie Val 



Pro Glu Glu Leu 
120 

Glu Phe Pro Thr 
135 

Asp Pro Asp Ala 

. 150 

Thr Gly Ala Thr 
165 

Glu Ala Leu Ala 



Trp Glu Ala Phe 
200 

Ser Asp Thr Gly 
215 

Gly Ala Asp Thr 
230 

Leu Ser Gly Arg 
245 

Thr Val Asp Thr 



Gly Gin Ser Leu 
280 

Val Thr Val Met 
295 
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Leu Val Arg Glu 
10 

Asp lie Pro Ala 
25 

Thr Ala Val Gin 



Leu Asn Ala Thr 
60 

Gly Lys Leu Gly 
75 

Arg Thr Ala Ala 
90 

Gly Met Ala Cys 
105 

Trp His Leu Val 



Asp Arg Gly Trp 
140 

lie Gly Lys Thr 
155 

Gly Phe Asp Ala 
170 

Met Asp Pro Gin 
185 

Glu Ser Ala Gly 



Val Phe Val Gly 
220 

Asp Gly Phe Gly 
235 

Leu Ser Tyr Phe 
250 

Ala Cys Ser Ser 
265 

Arg Ser Gly Glu 



Ala Ser Pro Gly 
300 



Ser Thr Ala Ala 
15 

Thr Ala Ala Phe 
30 

Leu Arg Asn Ala 
45 

Ala Val Phe Asp 



Asp Glu Leu Thr 
80 

Thr Ala Gly Ala 
95 

Arg Leu Pro Gly 
110 

Ala Ser Gly Thr 
125 

Asp Val Asp Ala 



Phe Val Arg His 
160 

Ala Phe Phe Gly 
175 

Gin Arg Val Leu 
190 

lie Thr Pro Asp 
205 

Ala Phe Ser Tyr 



Ala Thr Gly Ser 
240 

Tyr Gly Leu Glu 
255 

Ser Leu Val Ala 
270 

Cys Ser Leu Ala 
285 

Gly Phe Val Glu 



48 



^ 435 440 445 



Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe 
305 310 315 320 

Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala Gly Val Leu 
325 330 335 

lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His Thr Val Leu 
340 345 350 

Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
355 360 365 

Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val lie Arg Gin Ala 
370 375 380 

Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu Ala 
385 390 395 400 

His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Val 
□ 405 410 415 

Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu Leu Gly Ser 
V 420 425 430 

Leu Lys Ser Asn lie Gly His Ala Gin Ala Ala Ser Gly Val Ala Gly 

or He He Lys Met Val Gin Ala Leu Arg His Gly Glu Leu Pro Pro Thr 
s 450 455 460 

Q 

^ Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr Ala Gly Ala 
fij 465 470 475 480 

^ Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr Asp Arg Pro 

485 490 495 

Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser Gly Thr Asn Ala His 
500 505 510 

Val He Leu Glu Ser Ala Pro Pro Ala Gin Pro Ala Glu Glu Ala Gin 
515 520 525 

Pro Val Glu Thr Pro Val Val Ala Ser Asp Val Leu Pro Leu Val He 
530 535 540 

Ser Ala Lys Thr Gin Pro Ala Leu Thr Glu His Glu Asp Arg Leu Arg 
545 550 555 560 

Ala Tyr Leu Ala Ala Ser Pro Gly Ala Asp He Arg Ala Val Ala Ser 
565 570 575 

Thr Leu Ala Val Thr Arg Ser Val Phe Glu His Arg Ala Val Leu Leu 
580 585 590 



Gly Asp Asp Thr Val Thr Gly Thr Ala Val Thr Asp Pro Arg He Val 
595 600 605 
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Phe Val Phe Pro Gly Gin Gly Trp Gin Trp Leu Gly Met Gly Ser Ala 
610 615 620 

Leu Arg Asp Ser Ser Val Val Phe Ala Glu Arg Met Ala Glu Cys Ala 
625 630 635 640 

Ala Ala Leu Arg Glu Phe Val Asp Trp Asp Leu Phe Thr Val Leu Asp 
645 650 655 

Asp Pro Ala Val Val Asp Arg Val Asp Val Val Gin Pro Ala Ser Trp 
660 665 670 

Ala Met Met Val Ser Leu Ala Ala Val Trp Gin Ala Ala Gly Val Arg 
675 680 685 

Pro Asp Ala Val lie Gly His Ser Gin Gly Glu lie Ala Ala Ala Cys 
690 695 700 

Val Ala Gly Ala Val Ser Leu Arg Asp Ala Ala Arg lie Val Thr Leu 
705 710 715 720 

Arg Ser Gin Ala lie Ala Arg Gly Leu Ala Gly Arg Gly Ala Met Ala 
725 730 735 

Ser Val Ala Leu Pro Ala Gin Asp Val Glu Leu Val Asp Gly Ala Trp 
740 745 750 

lie Ala Ala His Asn Gly Pro Ala Ser Thr Val lie Ala Gly Thr Pro 
755 760 765 

Glu Ala Val Asp His Val Leu Thr Ala His Glu Ala Gin Gly Val Arg 
770 775 780 

Val Arg Arg lie Thr Val Asp Tyr Ala Ser His Thr Pro His Val Glu 
785 790 795 800 

Leu lie Arg Asp Glu Leu Leu Asp lie Thr Ser Asp Ser Ser Ser Gin 
805 810 815 

Thr Pro Leu Val Pro Trp Leu Ser Thr Val Asp Gly Thr Trp Val Asp 
820 825 830 

Ser Pro Leu Asp Gly Glu Tyr Trp Tyr Arg Asn Leu Arg Glu Pro Val 
835 840 845 

Gly Phe His Pro Ala Val Ser Gin Leu Gin Ala Gin Gly Asp Thr Val 
850 855 860 

Phe Val Glu Val Ser Ala Ser Pro Val Leu Leu Gin Ala Met Asp Asp 
865 870 875 880 

Asp Val Val Thr Val Ala Thr Leu Arg Arg -Asp Asp Gly Asp Ala Thr 
885 890 895 



Arg Met Leu Thr Ala Leu Ala Gin Ala Tyr Val His Gly Val Thr Val 
900 905 910 
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Asp Trp Pro Ala lie Leu Gly Thr Thr Thr Thr Arg Val Leu Asp Leu 
915 920 925 

Pro Thr Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu Ser Ala Arg 
930 935 940 

Pro Ala Ala Ser Asp Ala Gly His Pro Val Leu Gly Ser Gly lie Ala 
945 950 955 960 

Leu Ala Gly Ser Pro Gly Arg Val Phe Thr Gly Ser Val Pro Thr Gly 
965 970 975 

Ala Asp Arg Ala Val Phe Val Ala Glu Leu Ala Leu Ala Ala Ala Asp 
980 985 990 

Ala Val Asp Cys Ala Thr Val Glu Arg- Leu Asp lie Ala Ser Val Pro 
995 1000 1005 

Gly Arg Pro Gly His Gly Arg Thr Thr Val Gin Thr Trp Val Asp Glu 
□ 1010 1015 1020 

Pro Ala Asp Asp Gly Arg Arg Arg Phe Thr Val His Thr Arg Thr Gly 
^ 025 1030 1035 1040 

f^. Asp Ala Pro Trp Thr Leu His Ala Glu Gly Val Leu Arg Pro His Gly 

1045 1050 1055 

ff^ Thr Ala Leu Pro Asp Ala Ala Asp Ala Glu Trp Pro Pro Pro Gly Ala 
s 1060 1065 1070 

Q 

^ Val Pro Ala Asp Gly Leu Pro Gly Val Trp Arg Arg Gly Asp Gin Val 
flj 1075 1080 1085 



a 



Phe Ala Glu Ala Glu Val Asp Gly Pro Asp Gly Phe Val Val His Pro 
1090 1095 1100 

Asp Leu Leu Asp Ala Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin 
105 1110 1115 1120 

Pro Ala Gly Trp Arg Asp Leu Thr Val His Ala Ser Asp Ala Thr Val 
1125 1130 1135 

Leu Arg Ala Cys Leu Thr Arg Arg Thr Asp Gly Ala Met Gly Phe Ala 
1140 1145 1150 

Ala Phe Asp Gly Ala Gly Leu Pro Val Leu Thr Ala Glu Ala Val Thr 
1155 1160 1165 

Leu Arg Glu Val Ala Ser Pro Ser Gly Ser Glu Glu Ser Asp Gly Leu 
1170 1175 1180 

His Arg Leu Glu Trp Leu Ala Val Ala Glu Ala Val Tyr Asp Gly Asp 
185 1190 1195 1200 



Leu Pro Glu Gly His Val Leu Tie Thr Ala Ala His Pro Asp Asp Pro 
1205 1210 1215 
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Glu Asp lie Pro Thr Arg Ala His Thr Arg Ala Thr Arg Val Leu Thr 
1220 1225 1230 

Ala Leu Gin His His Leu Thr Thr Thr Asp His Thr Leu lie Val His 
1235 1240 1245 

Thr Thr Thr Asp Pro Ala Gly Ala Thr Val Thr Gly Leu Thr Arg Thr 
1250 1255 1260 

Ala Gin Asn Glu His Pro His Arg lie Arg Leu lie Glu Thr Asp His 
265 1270 1275 1280 

Pro His Thr Pro Leu Pro Leu Ala Gin Leu Ala Thr Leu Asp His Pro 
1285 1290 1295 

His Leu Arg Leu Thr His His Thr Leu His His Pro His Leu Thr Pro 
1300 1305 1310 

Leu His Thr Thr Thr Pro Pro Thr Thr Thr Pro Leu Asn Pro Glu His 
1315 1320 1325 

Ala lie lie lie Thr isly Gly Ser Gly Thr Leu Ala Gly lie Leu Ala 
1330 1335 1340 

Arg His Leu Asn His Pro His Thr Tyr Leu Leu Ser Arg Thr Pro Pro 
345 1350 1355 1360 

Pro Asp Ala Thr Pro Gly Thr His Leu Pro Cys Asp Val Gly Asp Pro 
1365 1370 1375 



Q His Gin Leu Ala Thr Thr Leu Thr His lie Pro Gin Pro Leu Thr Ala 
'm 1380 1385 1390 



lie Phe His Thr Ala Ala Thr Leu Asp Asp Gly lie Leu His Ala Leu 
1395 1400 1405 

Thr Pro Asp Arg Leu Thr Thr Val Leu His Pro Lys Ala Asn Ala Ala 
1410 1415 1420 

Trp His Leu His His Leu Thr Gin Asn Gin Pro Leu Thr His Phe Val 
425 1430 1435 1440 

Leu Tyr Ser Ser Ala Ala Ala Val Leu Gly Ser Pro Gly Gin Gly Asn 
1445 1450 1455 

Tyr Ala Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Thr His Arg His 
1460 1465 1470 

Thr Leu Gly Gin Pro Ala Thr Ser lie Ala Trp Gly Met Trp His Thr 
1475 1480 1485 

Thr Ser Thr Leu Thr Gly Gin Leu Asp Asp Ala Asp Arg Asp Arg lie 
1490 1495 1500 

Arg Arg Gly Gly Phe Leu Pro lie Thr Asp Asp Glu Gly 
505 1510 1515 
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<210> 20 
<211> 4466 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 

<221> CDS 

<222> (9) . . (4454) 



<400> 20 

agatctgg cag etc gcc gaa gcg ctg ctg acg etc gtc egg gag age ace 5 0 
Gin Leu Ala Glu Ala Leu Leu Thr Leu Val Arg Glu Ser Thr 
15 10 



gcc gcc gtg etc ggc cac gtg ggt gge gag gae ate cec gcg acg gcg 98 

Ala Ala Val Leu Gly His Val Gly Gly Glu Asp lie Pro Ala Thr Ala 
15 20 25 30 

gcg ttc aag gae etc ggc ate gac teg etc ace gcg gtc cag ctg cgc 146 

Ala Phe Lys Asp Leu Gly lie Asp Ser Leu Thr Ala Val Gin Leu Arg 
35 40 45 

aac gee etc ace gag gcg ace ggt gtg egg ctg aac gcc acg gcg gtc 194 

Asn Ala Leu Thr Glu Ala Thr Gly Val Arg Leu Asn Ala Thr Ala Val 

50 55 60 . : 



Q ttc gac ttc ccg ace ccg cac gtg etc gcc ggg aag etc ggc gae gaa 242 
Q] Phe Asp Phe Pro Thr Pro His Val Leu Ala Gly Lys Leu Gly Asp Glu 

ftj 70 75 

Pi ctg ace gge ace cgc gcg cec gtc gtg cec egg ace gcg gcc acg gee 2 90 

Leu Thr Gly Thr Arg Ala Pro Val Val Pro Arg Thr Ala Ala Thr Ala 
^ 80 85 90 



ggt gcg cac gac gag ccg ctg gcg ate gtg gga atg gcc tge egg ctg 33 8 
Gly Ala His Asp Glu Pro Leu Ala lie Val Gly Met Ala Cys Arg Leu 
95 100 105 110 



cec gge ggg gtc gcg tea cec gag gag ctg tgg cac etc gtg gea tee 3 86 
Pro Gly Gly Val Ala Ser Pro Glu Glu Leu Trp His Leu Val Ala Ser 
115 120 125 



gge ace gac gee ate acg gag ttc ccg acg gae cgc ggc tgg gae gtc 4 34 

Gly Thr Asp Ala lie Thr Glu Phe Pro Thr Asp Arg Gly Trp Asp Val 

130 135 140 

gac gcg ate tac gac ccg gae cec gac gcg ate ggc aag ace ttc gtc 4 82 

Asp Ala lie Tyr Asp Pro Asp Pro Asp Ala lie Gly Lys Thr Phe Val 
145 150 155 

egg cac ggt ggc ttc etc ace gge gcg aca ggc ttc gac gcg gcg ttc 53 0 

Arg His Gly Gly Phe Leu Thr Gly Ala Thr Gly Phe Asp Ala Ala Phe 

160 165 170 
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ttc ggc ate age eeg ege gag gee ete geg atg gac ccg cag cag egg 578 
Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
175 180 185 190 

gt§ cte ctg gag acg teg tgg gag geg tte gaa age gee gge ate ace 626 
Val Leu Leu Glu Thr Ser Trp Glu Ala Phe Glu Ser Ala Gly lie Thr 
195 200 205 

eeg gac teg ace ege gge age gae aee ggc gtg tte gte gge gee ttc 674 
Pro Asp Ser Thr Arg Gly Ser Asp Thr Gly Val Phe Val Gly Ala Phe 
210 215 220 

tee tac ggt tac gge aee ggt geg gac ace gac gge ttc gge geg aee 722 
Ser Tyr Gly Tyr Gly Thr Gly Ala Asp Thr Asp Gly Phe Gly Ala Thr 
225 230 235 

ggc teg cag aee agt gtg etc tec ggc egg ctg teg tac ttc tac ggt 770 
Gly Ser Gin Thr Ser Val Leu Ser Gly Arg Leu Ser Tyr Phe Tyr Gly 
240 245 250 

ctg gag ggt ccg geg gte acg gte gac acg geg tgt teg teg teg ctg 818 
Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu 
255 260 265 270 



f gtg geg ctg cac cag gcc ggg cag teg ctg ege tec gge gaa tgc teg 866 

Val Ala Leu His Gin Ala Gly Gin Ser Leu Arg Ser Gly Glu Cys Ser 

f-^ 275 280 285 

01 ete gee ctg gte gge ggc gte acg gtg atg geg tet ccc gge ggc ttc 914 

5 Leu Ala Leu Val Gly Gly Val Thr Val Met Ala Ser Pro Gly Gly . Phe 

Q 290 295 300 

m 

f|J 9tg gag ttc tec egg cag ege ggc etc geg ccg. gac gge egg geg aag 962 

l^^ Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala Lys 

305 310 315 

geg ttc ggc geg ggt geg gae gge acg age ttc gee gag ggt gee ggt 1010 

Ala Phe Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala Gly 
320 325 330 

gtg ctg ate gte gag agg etc tec gac gee gaa ege aac ggt cac ace 1058 

Val Leu lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His Thr 
335 340 345 350 

gte ctg geg gte gte cgt ggt teg geg gte aac cag gat ggt gee tec 1106 

Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser 

355 360 365 

aac ggg ctg teg geg eeg aac ggg ccg teg cag gag egg gtg ate egg 1154 

Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val lie Arg 

370 375 380 

cag gcc ctg gee aac gee ggg etc ace ccg geg gac gtg gac gcc gte 1202 

Gin Ala Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp Val Asp Ala Val 

385 390 395 
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gag gcc cac ggc acc ggc acc agg ctg ggc gac ccc ate gag gca cag 125 0 
Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin 
400 405 410 

gcg gta ctg gcc acc tac gga cag gag cgc gcc acc ccc ctg ctg ctg 1298 
Ala Val Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu Leu 
415 420 425 430 

ggc teg ctg aag tec aac ate ggc cac gcc cag gcc gcg tec ggc gtc 1346 
Gly Ser Leu Lys Ser Asn lie Gly His Ala Gin Ala Ala Ser Gly Val 
435 440 445 

gcc ggc ate ate aag atg gtg cag gcc etc egg cac ggg gag ctg eeg 13 94 
Ala Gly lie lie Lys Met Val Gin Ala Leu Arg His Gly Glu Leu Pro 
450 455 460 

ceg aeg ctg cac gee gac gag eeg teg ccg cac gtc gac tgg acg gcc 1442 
Pro Thr Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr Ala 
465 470 475 

ggc gcc gtc gaa ctg ctg aeg teg gee egg ccg tgg ccc gag acc gac 1490 
Gly Ala Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr Asp 
480 485 490 

egg cea egg egt gee gee gtc tec teg tte ggg gtg age ggc ace aac 153 8 
Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val Ser Gly Thr Asn 
495 500 505 510 



«_ gee cac gtc ate ctg gag gcc gga eeg gta aeg gag aeg ccc gcg gca 1586 

p Ala His Val lie Leu Glu Ala Gly Pro Val Thr Glu Thr Pro Ala Ala 
09 515 520 525 

m 

\j teg cct tec ggt gac ett ccc ctg ctg gtg teg gca cgc tea ccg gaa 1634 

Q Ser Pro Ser Gly Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu 
rr 530 535 540 

gcg etc gac gag cag ate cgc ega ctg cgc gee tac ctg gac acc ace 1682 

Ala Leu Asp Glu Gin lie Arg Arg Leu Arg Ala Tyr Leu Asp Thr Thr 

545 550 555 

ceg gac gtc gac egg gtg gcc gtg gca cag acg ctg gcc egg cgc aca 173 0 

Pro Asp Val Asp Arg Val Ala Val Ala Gin Thr Leu Ala Arg Arg Thr 
560 565 570 

cac tte gcc cac cgc gee gtg ctg etc ggt gac ace gtc ate acc aca 1778 

His Phe Ala His Arg Ala Val Leu Leu Gly Asp Thr Val lie Thr Thr 
575 580 585 590 

ccc ccc gcg gac egg ccc gac gaa etc gtc tte gtc tac tec ggc cag 1826 

Pro Pro Ala Asp Arg Pro Asp Glu Leu Val Phe Val Tyr Ser Gly Gin 
595 600 605 



ggc ace cag cat ccc gcg atg ggc gag cag eta gee gcc gcg tte ccc 
Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala Phe Pro 
610 615 620 



1874 
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gtc ttc gcg egg ate cat cag cag gtg tgg gac ctg etc gat gtg eee 1922 
Val Phe Ala Arg lie His Gin Gin Val Trp Asp Leu Leu Asp Val Pro 
625 630 635 

gat ctg gag gtg aac gag ace ggt tac gee cag ccg gee ctg ttc gca 1970 
Asp Leu Glu Val Asn Glu Thr Gly Tyr Ala Gin Pro Ala Leu Phe Ala 
640 645 650 

atg cag gtg get ctg ttc ggg ctg ctg gaa teg tgg ggt gta cga ccg 2018 
Met Gin Val Ala Leu Phe Gly Leu Leu Glu Ser Trp Gly Val Arg Pro 
655 660 665 670 

gac gcg gtg ate ggc cat teg gtg ggt gag ett gcg get gcg tat gtg 2066 
Asp Ala Val lie Gly His Ser Val Gly Glu Leu Ala Ala Ala Tyr Val 
675 680 685 

tec ggg gtg tgg teg ttg gag gat gee tgc act ttg gtg teg gcg egg 2114 
Ser Gly Val Trp Ser Leu Glu Asp Ala Cys Thr Leu Val Ser Ala Arg 
690 695 700 

get cgt ctg atg cag get ctg cec gcg ggt ggg gtg atg gtc get gtc 2162 
Ala Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Val Met Val Ala Val 
705 710 715 

ccg gtc teg gag gat gag gee egg gee gtg ctg ggt gag ggt gtg gag 2210 
Pro Val Ser Glu Asp Glu Ala Arg Ala Val Leu Gly Glu Gly Val Glu 
720 725 730 



- ate gee gcg gtc aac ggc ccg teg teg gtg gtt etc tec ggt gat gag 2258 
p lie Ala Ala Val Asn Gly Pro Ser Ser Val Val Leu Ser Gly Asp Glu 
fn 735 740 745 750 



gee gee gtg ctg cag gee gcg gag ggg ctg ggg aag tgg acg egg ctg 2 3 06 
Ala Ala Val Leu Gin Ala Ala Glu Gly Leu Gly Lys Trp Thr Arg Leu 
755 760 765 

gcg ace age cae gcg ttc cat tec gee cgt atg gaa cec atg ctg gag 2354 
Ala Thr Ser His Ala Phe His Ser Ala Arg Met Glu Pro Met Leu Glu 
770 775 780 

gag ttc egg gcg gtc gee gaa ggc ctg ace tac egg acg ccg cag gtc 2402 
Glu Phe Arg Ala Val Ala Glu Gly Leu Thr Tyr Arg Thr Pro Gin Val 
785 790 795 

tec atg gee gtt ggt gat cag gtg ace ace get gag tac tgg gtg egg 2450 
Ser Met Ala Val Gly Asp Gin Val Thr Thr Ala Glu Tyr Trp Val Arg 
800 805 810 

cag gtc egg gac acg gtc egg ttc ggc gag cag gtg gee teg tac gag 24 98 
Gin Val Arg Asp Thr Val Arg Phe Gly Glu Gin Val Ala Ser Tyr Glu 
815 820 825 830 

gac gee gtg ttc gtc gag ctg ggt gee gac egg tea ctg gee cgc ctg 2546 
Asp Ala Val Phe Val Glu Leu Gly Ala Asp Arg Ser Leu Ala Arg Leu 
835 840 * 845 
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gtc gac ggt gtc gcg atg ctg cac ggc gac cac gaa ate cag gcc gcg 2594 
Val Asp Gly Val Ala Met Leu His Gly Asp His Glu lie Gin Ala Ala 
850 855 860 

ate ggc gcc ctg gcc cac ctg tat gtc aac ggc gtc acg gtc gac tgg 2642 
lie Gly Ala Leu Ala His Leu Tyr Val Asn Gly Val Thr Val Asp Trp 
865 870 875 

ccc gcg etc ctg ggc gat get ccg gea aca egg gtg ctg gac ctt ccg 2690 
Pro Ala Leu Leu Gly Asp Ala Pro Ala Thr Arg Val Leu Asp Leu Pro 
880 885 890 

aca tac gcc ttc cag cac cag cge tac tgg etc gag teg gea ege ccg 273 8 
Thr Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu Ser Ala Arg Pro 
895 900 905 910 

gee gea tec gac gcg ggc cac ccc gtg ctg ggc tec ggt ate gcc etc 2786 
Ala Ala Ser Asp Ala Gly His Pro Val Leu Gly Ser Gly lie Ala Leu 
915 920 925 

gcc ggg teg ccg ggc egg gtg ttc acg ggt tec gtg ccg ace ggt gcg 2834 
Ala Gly Ser Pro Gly Arg Val Phe Thr Gly Ser Val Pro Thr Gly Ala 
930 935 940 

gac cge gcg gtg ttc gtc gcc gag ctg gcg ctg gee gee gcg gac gcg 2 8 82 
Asp Arg Ala Val Phe Val Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala 
945 950 955 

gtc gac tge gee acg gtc gag egg etc gac ate gee tec gtg ccc ggc 2930 
Val Asp Cys Ala Thr Val Glu Arg Leu Asp lie Ala Ser Val Pro Gly 
960 965 970 

egg ccg ggc cat ggc egg acg ace gta cag ace tgg gtc gac gag ccg 2 978 
Arg Pro Gly His Gly Arg Thr Thr Val Gin Thr Trp Val Asp Glu Pro 
975 980 985 990 

gcg gac gac ggc egg cge egg ttc ace gtg cae ace cge ace ggc gac 3 026 
Ala Asp Asp Gly Arg Arg Arg Phe Thr Val His Thr Arg Thr Gly Asp 
995 1000 1005 

gcc ccg tgg acg ctg cac gee gag ggg gtg ctg cge ccc eat ggc aeg 3 074 
Ala Pro Trp Thr Leu His Ala Glu Gly Val Leu Arg Pro His Gly Thr 
1010 1015 1020 

gcc ctg ccc gat gcg gee gac gee gag tgg ccc cea ccg ggc gcg gtg 3122 
Ala Leu Pro Asp Ala Ala Asp Ala Glu Trp Pro Pro Pro Gly Ala Val 
1025 1030 1035 

ccc gcg gac ggg ctg ccg ggt gtg tgg cge egg ggg gac cag gtc ttc 3170 
Pro Ala Asp Gly Leu Pro Gly Val Trp Arg Arg Gly Asp Gin Val Phe 
1040 1045 1050 

gee gag gee gag gtg gac gga ccg gac ggt ttc gtg gtg cac ccc gac 3218 
Ala Glu Ala Glu Val Asp Gly Pro Asp Gly Phe Val Val His Pro Asp 
1055 1060 1065 1070 
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ctg etc gac gcg gtc ttc tec gcg gtc ggc gac gga age cgc cag eeg 3266 
Leu Leu Asp Ala Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro 
1075 1080 1085 

gee gga tgg cgc gac ctg acg gtg cac gcg teg gac gee acc gta ctg 3314 
Ala Gly Trp Arg Asp Leu Thr Val His Ala Ser Asp Ala Thr Val Leu 
1090 1095 1100 

cgc gcc tgc etc acc egg cgc ace gac gga gee atg gga ttc gcc gee 3362 
Arg Ala Cys Leu Thr Arg Arg Thr Asp Gly Ala Met Gly Phe Ala Ala 
1105 1110 1115 

ttc gac ggc gcc ggc ctg eeg gta etc acc gcg gag gcg gtg acg ctg 3410 
Phe Asp Gly Ala Gly Leu Pro Val Leu Thr Ala Glu Ala Val Thr Leu 
1120 1125 1130 

egg gag gtg gcg tea ccg tec ggc tee gag gag teg gac ggc ctg cac 3458 
Arg Glu Val Ala Ser Pro Ser Gly Ser Glu Glu Ser Asp Gly Leu His 
1135 1140 1145 1150 

egg ttg gag tgg etc gcg gtc gcc gag gcg gtc tac gac ggt gac ctg 3 506 
Arg Leu Glu Trp Leu Ala Val Ala Glu Ala Val Tyr Asp Gly Asp Leu 
1155 1160 1165 

ecc gag gga cat gtc ctg ate acc gee gcc cae eee gac gac cec gag 3554 
Pro Glu Gly His Val Leu lie Thr Ala Ala His Pro Asp Asp Pro Glu 
1170 1175 1180 

gac ata cec acc cgc gee cae acc cgc gee ace cgc gtc ctg acc gcc 3602 
Asp lie Pro Thr Arg Ala His Thr Arg Ala Thr Arg Val Leu Thr Ala 
1185 1190 1195 

ctg caa cac cac etc ace ace acc gac cac acc etc ate gtc cae acc 3650 
Leu Gin His His Leu Thr Thr Thr Asp His Thr Leu lie Val His Thr 
1200 1205 1210 

acc acc gac cec gcc ggc gee acc gtc acc ggc etc acc cgc acc gee 3 698 
Thr Thr Asp Pro Ala Gly Ala Thr Val Thr Gly Leu Thr Arg Thr Ala 
1215 1220 1225 1230 

cag aac gaa cac ecc cac cgc ate cgc etc ate gaa acc gac cae cec 3746 
Gin Asn Glu His Pro His Arg lie Arg Leu lie Glu Thr Asp His Pro 
1235 1240 1245 

cac ace cec etc cec ctg gcc caa etc gee ace etc gac cac cec cac 3794 
His Thr Pro Leu Pro Leu Ala Gin Leu Ala Thr Leu Asp His Pro His 
1250 1255 1260 

etc cgc etc acc cac cae acc etc eac cac ecc cac etc ace cec etc 3842 
Leu Arg Leu Thr His His Thr Leu His His Pro His Leu Thr Pro Leu 
1265 1270 1275 



cae acc ace acc cca ecc acc acc acc cec etc aac cec gaa cac gee 
His Thr Thr Thr Pro Pro Thr Thr Thr Pro Leu Asn Pro Glu His Ala 
1280 1285 1290 



3890 
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ate ate ate acc gge ggc tec ggc aec etc gee ggc ate etc gee cgc 393 8 
lie lie lie Thr Gly Gly Ser Gly Thr Leu Ala Gly lie Leu Ala Arg 
1295 1300 1305 1310 

eac ctg aac eac cce cae aec tae etc etc tec cgc ace cea cec ecc 3986 
His Leu Asn His Pro His Thr Tyr Leu Leu Ser Arg Thr Pro Pro Pro 
1315 1320 1325 

gac gee acc cce ggc acc eac etc cce tgc gac gtc gge gac cce eac 4034 
Asp Ala Thr Pro Gly Thr His Leu Pro Cys Asp Val Gly Asp Pro His 
1330 1335 1340 

caa etc gee acc acc etc acc cae ate cce caa cec etc acc gee ate 4082 
Gin Leu Ala Thr Thr Leu Thr His lie Pro Gin Pro Leu Thr Ala lie 
1345 1350 1355 

ttc eac acc gee gee aec etc gac gac gge ate etc cae gee etc ace 4130 
Phe His Thr Ala Ala Thr Leu Asp Asp Gly lie Leu His Ala Leu Thr 
1360 1365 1370 

cce gac cgc etc ace ace gtc etc cae cce aaa gee aac gee gee tgg 4178 
'^^^ "^^^ Pro Lys Ala Asn Ala Ala Trp 



1375 1380 1385 



1390 



eac ctg cae eac etc ace caa aac caa cec etc acc cae ttc gtc etc 4226 
His Leu His His Leu Thr Gin Asn Gin Pro Leu Thr His Phe Val Leu 
1395 1400 1405 



~™ "^^^ "39^ cec gga caa gga aac tae 4274 

Q Tyr Ser Ser Ala Ala Ala Val Leu Gly Ser Pro Gly Gin Gly Asn Tyr 
^ 1410 1415 1420 

m 

%l 9CC gee gee aac gee ttc etc gac gee etc gee ace cae cgc cae acc 4322 

^ Ala Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Thr His Arg His Thr 
1425 1430 1435 

etc ggc caa cec gee aec tec ate gee tgg ggc atg tgg eac acc acc 4370 
Leu Gly Gin Pro Ala Thr Ser He Ala Trp Gly Met Trp His Thr Thr 
1440 1445 1450 

age acc etc acc gga caa etc gac gac gee gac egg gac cgc ate cgc 4418 
Ser Thr Leu Thr Gly Gin Leu Asp Asp Ala Asp Arg Asp Arg He Arg 
1455 1460 1465 1470 

cgc ggc ggt ttc etc ceg ate aeg gac gac gag ggc atggggatgc at 4466 
Arg Gly Gly Phe Leu Pro He Thr Asp Asp Glu Gly 
1475 1480 



<210> 21 
<211> 1482 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 
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<400> 21 
Gin Leu Ala Glu 
1 

Val Leu Gly His 
20 

Lys Asp Leu Gly 
35 

Leu Thr Glu Ala 
50 

Phe Pro Thr Pro 
65 

Gly Thr Arg Ala 



His Asp Glu Pro 
100 

Gly Val Ala Ser 
115 

Asp Ala lie Thr 
130 

lie Tyr Asp Pro 
145 

Gly Gly Phe Leu 



lie Ser Pro Arg 
180 

Leu Glu Thr Ser 
195 

Ser Thr Arg Gly 
210 

Gly Tyr Gly Thr 
225 

Gin Thr Ser Val 



Gly Pro Ala Val 
260 

Leu His Gin Ala 
275 

Leu Val Gly Gly 
290 



Ala Leu Leu Thr 
5 

Val Gly Gly Glu 



lie Asp Ser Leu 
40 

Thr Gly Val Arg 
55 

His Val Leu Ala 
70 

Pro Val Val Pro 
85 

Leu Ala lie Val 



Pro Glu Glu Leu 
120 

Glu Phe Pro Thr 
135 

Asp Pro Asp Ala , 
150 

Thr Gly Ala Thr 
165 

Glu Ala Leu Ala 



Trp Glu Ala Phe 
200 

Ser Asp Thr Gly 
215 

Gly Ala Asp Thr 
230 

Leu Ser Gly Arg 
245 

Thr Val Asp Thr 



Gly Gin Ser Leu 
280 

Val Thr Val Met 
295 



Leu Val Arg Glu 
10 

Asp lie Pro Ala 
25 

Thr Ala Val Gin 



Leu Asn Ala Thr 
60 

Gly Lys Leu Gly 
75 

Arg Thr Ala Ala 
90 

Gly Met Ala Cys 
105 

Trp His Leu Val 



Asp Arg Gly Trp 
140 

lie Gly Lys Thr 
155 

Gly Phe Asp Ala 
170 

Met Asp Pro Gin 
185 

Glu Ser Ala Gly 



Val Phe Val Gly 
220 

Asp Gly Phe Gly 
235 

Leu Ser Tyr Phe 
250 

Ala Cys Ser Ser 
265 

Arg Ser Gly Glu 



Ala Ser Pro Gly 
300 



Ser Thr Ala Ala 
15 

Thr Ala Ala Phe 
30 

Leu Arg Asn Ala 
45 

Ala Val Phe Asp 



Asp Glu Leu Thr 
80 

Thr Ala Gly Ala 
95 

Arg Leu Pro Gly 
110 

Ala Ser Gly Thr 
125 

Asp Val Asp Ala 



Phe Val Arg His 
160 

Ala Phe Phe Gly 
175 

Gin Arg Val Leu 
190 

lie Thr Pro Asp 
205 

Ala Phe Ser Tyr 



Ala Thr Gly Ser 
240 

Tyr Gly Leu Glu 
255 

Ser Leu Val Ala 
270 

Cys Ser Leu Ala 
285 

Gly Phe Val Glu 
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Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe 
305 310 315 320 

Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala Gly Val Leu 
325 330 335 

lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His Thr Val Leu 
340 345 350 

Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
355 360 365 

Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val lie Arg Gin Ala 
370 375 380 

Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu Ala 
385 390 395 400 

His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Val 
405 410 415 

Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu Leu Gly Ser 
420 425 430 

Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ser Gly Val Ala Gly 
435 440 445 

He lie. Lys Met Val Gin Ala Leu Arg His Gly Glu Leu Pro Pro Thr 
450 455 460 . . 

Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr Ala Gly Ala 
465 470 475 480 

Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr Asp Arg Pro 
485 490 495 

Arg Arg Ala Ala Val Ser Ser Phe Gly Val Ser Gly Thr Asn Ala His 
500 505 510 

Val He Leu Glu Ala Gly Pro Val Thr Glu Thr Pro Ala Ala Ser Pro 
515 520 525 

Ser Gly Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu Ala Leu 
530 535 540 

Asp Glu Gin He Arg Arg Leu Arg Ala Tyr Leu Asp Thr Thr Pro Asp 
545 550 555 560 

Val Asp Arg Val Ala Val Ala Gin Thr Leu Ala Arg Arg Thr His Phe 
565 570 575 

Ala His Arg Ala Val Leu Leu Gly Asp Thr Val He Thr Thr Pro Pro 
580 585 590 

Ala Asp Arg Pro Asp Glu Leu Val Phe Val Tyr Ser Gly Gin Gly Thr 
595 600 605 
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Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala Phe Pro Val Phe 
610 615 620 

Ala Arg lie His Gin Gin Val Trp Asp Leu Leu Asp Val Pro Asp Leu 
625 630 635 640 

Glu Val Asn Glu Thr Gly Tyr Ala Gin Pro Ala Leu Phe Ala Met Gin 
645 650 655 

Val Ala Leu Phe Gly Leu Leu Glu Ser Trp Gly Val Arg Pro Asp Ala 
660 665 670 

Val lie Gly His Ser Val Gly Glu Leu Ala Ala Ala Tyr Val Ser Gly 
675 680 685 

Val Trp Ser Leu Glu Asp Ala Cys Thr Leu Val Ser Ala Arg Ala Arg 
690 695 700 

Leu Met Gin Ala Leu Pro Ala Gly Gly Val Met Val Ala Val Pro Val 
705 710 715 720 

Ser Glu Asp Glu Ala Arg Ala Val Leu Gly Glu Gly Val Glu lie Ala 
725 730 735 

Ala Val Asn Gly Pro Ser Ser Val Val Leu Ser Gly Asp Glu Ala Ala 
740 745 750 

Val Leu Gin Ala Ala Glu Gly Leu Gly Lys Trp Thr Arg Leu Ala Thr 
755 760 765 

Ser His Ala Phe His Ser Ala Arg Met Glu Pro Met Leu Glu Glu Phe 
770 775 780 

Arg Ala Val Ala Glu Gly Leu Thr Tyr Arg Thr Pro Gin Val Ser Met 
785 790 795 800 

Ala Val Gly Asp Gin Val Thr Thr Ala Glu Tyr Trp Val Arg Gin Val 
805 810 815 

Arg Asp Thr Val Arg Phe Gly Glu Gin Val Ala Ser Tyr Glu Asp Ala 
820 825 830 

Val Phe Val Glu Leu Gly Ala Asp Arg Ser Leu Ala Arg Leu Val Asp 
835 840 845 

Gly Val Ala Met Leu His Gly Asp His Glu lie Gin Ala Ala lie Gly 
850 855 860 

Ala Leu Ala His Leu Tyr Val Asn Gly Val Thr Val Asp Trp Pro Ala 
865 870 875 880 

Leu Leu Gly Asp Ala Pro Ala Thr Arg Val Leu Asp Leu Pro Thr Tyr 
885 890 895 

Ala Phe Gin His Gin Arg Tyr Trp Leu Glu Ser Ala Arg Pro Ala Ala 
900 905 910 
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Ser Asp Ala Gly His Pro Val Leu Gly Ser Gly lie Ala Leu Ala Gly 
915 920 925 

Ser Pro Gly Arg Val Phe Thr Gly Ser Val Pro Thr Gly Ala Asp Arg 
930 935 940 

Ala Val Phe Val Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Val Asp 
945 950 955 960 

Cys Ala Thr Val Glu Arg Leu Asp lie Ala Ser Val Pro Gly Arg Pro 
965 970 975 

Gly His Gly Arg Thr Thr Val Gin Thr Trp Val Asp Glu Pro Ala Asp 
980 985 990 

Asp Gly Arg Arg Arg Phe Thr Val His Thr Arg Thr Gly Asp Ala Pro 
995 1000 1005 

Trp Thr Leu His Ala Glu Gly Val Leu Arg Pro His Gly Thr Ala Leu 
1010 1015 1020 

Pro Asp Ala Ala Asp Ala Glu Trp Pro Pro Pro Gly Ala Val Pro Ala 
025 1030 1035 1040 

Asp Gly Leu Pro Gly Val Trp Arg Arg Gly Asp Gin Val Phe Ala Glu 
1045 1050 1055 

Ala Glu Val Asp Gly Pro Asp Gly Phe Val Val His Pro Asp Leu Leu 
1060 1065 1070 

Asp Ala Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Ala Gly 
1075 1080 1085 

Trp Arg Asp Leu Thr Val His Ala Ser Asp Ala Thr Val Leu Arg Ala 
1090 1095 1100 

Cys Leu Thr Arg Arg Thr Asp Gly Ala Met Gly Phe Ala Ala Phe Asp 
105 1110 1115 1120 



Gly Ala Gly Leu Pro 
1125 

Val Ala Ser Pro Ser 
1140 

Glu Trp Leu Ala Val 
1155 

Gly His Val Leu lie 
1170 

Pro Thr Arg Ala His 
185 



Val Leu Thr Ala Glu 
1130 

Gly Ser Glu Glu Ser 
1145 

Ala Glu Ala Val Tyr 
1160 

Thr Ala Ala His Pro 
1175 

Thr Arg Ala Thr Arg 
190 



Ala Val Thr Leu Arg Glu 
1135 

Asp Gly Leu His Arg Leu 
1150 

Asp Gly Asp Leu Pro Glu 
1165 

Asp Asp Pro Glu Asp lie 
1180 

Val Leu Thr Ala Leu Gin 
.195 1200 



His His Leu Thr Thr Thr Asp His Thr Leu lie Val His Thr Thr Thr 
1205 1210 1215 
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Asp Pro Ala Gly Ala Thr Val Thr Gly Leu Thr Arg Thr Ala Gin Asn 
1220 1225 1230 

Glu His Pro His Arg lie Arg Leu He Glu Thr Asp His Pro His Thr 
1235 1240 1245 

Pro Leu Pro Leu Ala Gin Leu Ala Thr Leu Asp His Pro His Leu Arg 
1250 1255 1260 

Leu Thr His His Thr Leu His His Pro His Leu Thr Pro Leu His Thr 
265 1270 1275 1280 

Thr Thr Pro Pro Thr Thr Thr Pro Leu Asn Pro Glu His Ala He He 
1285 1290 1295 

He Thr Gly Gly Ser Gly Thr Leu Ala Gly He Leu Ala Arg His Leu 
1300 1305 1310 

Asn His Pro His Thr Tyr Leu Leu Ser Arg Thr Pro Pro Pro Asp Ala 
1315 1320 1325 

Thr Pro Gly Thr His Leu Pro Cys Asp Val Gly Asp Pro His Gin Leu 
1330 1335 1340 

Ala Thr Thr Leu Thr His He Pro Gin Pro Leu Thr Ala He Phe His 
345 1350 1355 1360 

Thr Ala Ala Thr Leu Asp Asp Gly He Leu His Ala Leu Thr Pro Asp 
1365 1370 1375 

Arg Leu Thr Thr Val Leu His Pro Lys Ala Asn Ala Ala Trp His Leu 
1380 1385 1390 

His His Leu Thr Gin Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser 
1395 1400 1405 

Ser Ala Ala Ala Val Leu Gly Ser Pro Gly Gin Gly Asn Tyr Ala Ala 
1410 1415 1420 

Ala Asn Ala Phe Leu Asp Ala Leu Ala Thr His Arg His Thr Leu Gly 
425 -1430 1435 1440 

Gin Pro Ala Thr Ser He Ala Trp Gly Met Trp His Thr Thr Ser Thr 
1445 1450 1455 

Leu Thr Gly Gin Leu Asp Asp Ala Asp Arg Asp. Arg He Arg Arg Gly 
1460 1465 1470 

Gly Phe Leu Pro He Thr Asp Asp Glu Gly 
1475 1480 



<210> 22 
<211> 4547 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 

<221> CDS 

<222> (9) . . (4535) 

<400> 22 

agatctgg cag etc gcc gaa gcg ctg ctg acg etc gtc egg gag age ace 
Gin Leu Ala Glu Ala Leu Leu Thr Leu Val Arg Glu Ser Thr 
15 10 

gcc gcc gtg etc ggc cac gtg ggt ggc gag gac ate cec gcg acg gcg 
Ala Ala Val Leu Gly His Val Gly Gly Glu Asp lie Pro Ala Thr Ala 
15 20 25 30 

gcg ttc aag gac etc ggc ate gac teg etc acc gcg gtc cag ctg cge 
Ala Phe Lys Asp Leu Gly lie Asp Ser Leu Thr Ala Val Gin Leu Arg 
35 40 45 

aac gcc etc ace gag gcg acc ggt gtg egg ctg aae gcc acg gcg gtc 
Asn Ala Leu Thr Glu Ala Thr Gly Val Arg Leu Asn Ala Thr Ala Val 
50 55 60 

ttc gac ttc ccg acc eeg cac gtg etc gcc ggg aag etc ggc gac gaa 
Phe Asp Phe Pro Thr Pro His Val Leu Ala Gly Lys Leu Gly Asp Glu 
65 70 75 

ctg ace ggc ace cgc gcg cec gtc gtg cec egg acc gcg gcc acg gcc 
Leu Thr Gly Thr Arg Ala Pro Val Val Pro Arg Thr Ala Ala Thr Ala 
80 85 90 

ggt gcg cac gac gag ccg ctg gcg ate gtg gga atg gcc tgc egg ctg 
Gly Ala His Asp Glu Pro Leu Ala lie Val Gly Met Ala Cys Arg Leu 
95 100 105 110 

cec ggc ggg gtc gcg tea cec gag gag ctg tgg cac etc gtg gca tee 
Pro Gly Gly Val Ala Ser Pro Glu Glu Leu Trp His Leu Val Ala Ser 
115 120 125 

ggc ace gac gee ate acg gag ttc ccg acg gac cgc ggc tgg gac gtc 
Gly Thr Asp Ala lie Thr Glu Phe Pro Thr Asp Arg Gly Trp Asp Val 
130 135 140 

gac gcg ate tac gac ccg gac cec gac gcg ate ggc aag ace ttc gtc 
Asp Ala lie Tyr Asp Pro Asp Pro Asp Ala lie Gly Lys Thr Phe Val 
145 150 155 

egg cac ggt ggc ttc etc ace ggc gcg aea ggc ttc gac gcg gcg ttc 
Arg His Gly Gly Phe Leu Thr Gly Ala Thr Gly Phe Asp Ala Ala Phe 
160 165 170 

ttc ggc ate age ccg cgc gag gee etc gcg atg gac ccg cag cag egg 
Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg 
175 180 185 190 
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gtg etc ctg gag acg teg tgg gag gcg ttc gaa age gee gge ate ace 626 
Val Leu Leu Glu Thr Ser Trp Glu Ala Phe Glu Ser Ala Gly lie Thr 
195 200 205 

ccg gac teg ace cgc ggc age gae ace gge gtg ttc gte gge gee ttc 674 
Pro Asp Ser Thr Arg Gly Ser Asp Thr Gly Val Phe Val Gly Ala Phe 
210 215 220 

tec tae ggt tae ggc ace ggt gcg gac acc gac ggc ttc ggc gcg ace 722 
Ser Tyr Gly Tyr Gly Thr Gly Ala Asp Thr Asp Gly Phe Gly Ala Thr 
225 230 235 

gge teg cag acc agt gtg etc tec ggc egg ctg teg tae ttc tae ggt 770 
Gly Ser Gin Thr Ser Val Leu Ser Gly Arg Leu Ser Tyr Phe Tyr Gly 
240 245 250 

ctg gag ggt ccg gcg gte acg gte gac acg gcg tgt teg teg teg ctg 818 
Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu 
255 260 265 270 

gtg gcg ctg cac cag gee ggg cag teg ctg cgc tec gge gaa tgc teg 866 
Val Ala Leu His Gin Ala Gly Gin Ser Leu Arg Ser Gly Glu Cys Ser 
275 280 285 



^ . etc gee ctg gte gge gge gte acg gtg atg gcg tct ccc ggc gge ttc 914 
"^^^ Leu Ala Leu Val Gly Gly Val Thr Val Met Ala Ser Pro Gly Gly Phe 
^ 290 295 300 

m . . . ■ . 

gtg gag ttc tec egg cag cgc gge etc gcg ccg gae gge egg gcg aag 962 
Q Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala Lys 
fQ 305 310 315 

hi 

gcg ttc gge gcg ggt gcg gac ggc acg age ttc gee gag ggt gee ggt 1010 

^ Ala Phe Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala Gly 
rf 320 325 330 

gtg ctg ate gte gag agg etc tec gac gee gaa cgc aac ggt cac ace 1058 
Val Leu lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His Thr 
335 340 345 350 

gte ctg gcg gte gte egt ggt teg gcg gte aac cag gat ggt gee tec 1106 
Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser 
355 360 365 

aac ggg ctg teg gcg ccg aac ggg ccg teg cag gag egg gtg ate egg 1154 
Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val lie Arg 
370 375 380 

cag gee ctg gee aac gee ggg etc acc ccg gcg gae gtg gac gee gte 1202 
Gin Ala Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp Val Asp Ala Val 
385 390 395 

gag gee cac gge ace ggc acc agg ctg ggc gac ccc ate gag gea cag 1250 
Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin 
400 405 410 
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gcg gta ctg gcc acc tac gga cag gag cgc gcc acc ccc ctg ctg ctg 12 98 

Ala Val Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu Leu 
415 420 425 430 

ggc teg ctg aag tec aac ate ggc cac gcc cag gcc gcg tee ggc gtc 1346 
Gly Ser Leu Lys Ser Asn lie Gly His Ala Gin Ala Ala Ser Gly Val 
435 440 445 

gcc ggc ate ate aag atg gtg cag gcc etc egg cac ggg gag ctg ceg 1394 
Ala Gly lie lie Lys Met Val Gin Ala Leu Arg His Gly Glu Leu Pro 
450 455 460 

ceg acg ctg cac gcc gae gag ccg teg ccg cac gtc gac tgg acg gcc 1442 
Pro Thr Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr Ala 
465 470 475 

ggc gcc gtc gaa ctg ctg acg teg gcc egg ccg tgg ccc gag acc gac 1490 
Gly Ala Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr Asp 
480 485 490 

egg cca egg cgt gcc gcc gtc tec teg ttc ggg gtg age ggc acc aac 153 8 
Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val Ser Gly Thr Asn 
495 500 505 510 



gee cac gtc ate ctg gag gee gga ceg gta acg gag acg ccc gcg gea 1586 

ty Ala His Val lie Leu Glu Ala Gly Pro Val Thr Glu Thr Pro Ala Ala 

515 520 525 

m 

g teg cct tec ggt gac ett ccc ctg ctg gtg teg gea cgc tea ccg gaa 1634 

p Ser Pro Ser Gly Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu 
5 530 535 540 

fit 

l"] gcg etc gac gag cag ate cgc ega ctg cgc gee tac ctg gac ace ace 1682 

^ Ala Leu Asp Glu Gin lie Arg Arg Leu Arg Ala Tyr Leu Asp Thr Thr 

Q 545 550 555 

ccg gae gtc gac egg gtg gcc gtg gea cag acg ctg gee egg cgc aca 173 0 
Pro Asp Val Asp Arg Val Ala Val Ala Gin Thr Leu Ala Arg Arg Thr 
560 565 570 

cac ttc gee cac cgc gee gtg ctg etc ggt gac acc gtc ate acc aca 1778 
His Phe Ala His Arg Ala Val Leu Leu Gly Asp Thr Val lie Thr Thr 
575 580 585 590 

ccc ccc gcg gae egg ccc gac gaa etc gtc ttc gtc tac tec ggc cag 1826 
Pro Pro Ala Asp Arg Pro Asp Glu Leu Val Phe Val Tyr Ser Gly Gin 
595 600 605 

ggc ace cag cat ccc gcg atg ggc gag cag eta gcc gat teg teg gtg 1874 
Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Asp Ser Ser Val 
610 615 620 



gtg ttc gcc gag egg atg gee gag tgt gcg gcg gcg ttg cgc gag ttc 
Val Phe Ala Glu Arg Met Ala Glu Cys Ala Ala Ala Leu Arg Glu Phe 
625 630 635 



1922 
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■5 



01 



gtg gac tgg gat ctg ttc acg gtt ctg gat gat ccg gcg gtg gtg gac 197G 
Val Asp Trp Asp Leu Phe Thr Val Leu Asp Asp Pro Ala Val Val Asp 
640 645 650 

egg gtt gat gtg gtc cag ccc get tec tgg geg atg atg gtt tee ctg 2018 
Arg Val Asp Val Val Gin Pro Ala Ser Trp Ala Met Met Val Ser Leu 
655 660 665 670 

gee gcg gtg tgg cag gcg gcc ggt gtg egg ccg gat gcg gtg ate gge 2066 
Ala Ala Val Trp Gin Ala Ala Gly Val Arg Pro Asp Ala Val lie Gly 
675 680 685 

eat teg cag ggt gag ate gcc gca get tgt gtg gcg ggt gcg gtg tea 2114 
His Ser Gin Gly Glu lie Ala Ala Ala Cys Val Ala Gly Ala Val Ser 
690 695 700 

eta cgc gat gcc gee egg ate gtg ace ttg egc age cag gcg ate gcc 2162 
Leu Arg Asp Ala Ala Arg lie Val Thr Leu Arg Ser Gin Ala lie Ala 
705 710 715 

egg ggc ctg gcg gge egg gge gcg atg gea tee gtc gcc ctg ccc gcg 2210 
Arg Gly Leu Ala Gly Arg Gly Ala Met Ala Ser Val Ala Leu Pro Ala 
720 725 730 

cag gat gtc gag ctg gtc gac ggg gee tgg ate gee gee cae aae ggg 2258 
Gin Asp Val Glu Leu Val Asp Gly Ala Trp lie Ala Ala His Asn Gly 
735 740 . . 745 750 



s CCC gee tee aee gtg ate gcg gge ace ccg gaa geg gtc gac eat gtc 2306 
p Pro Ala Ser Thr Val lie Ala Gly Thr Pro Glu Ala Val Asp His Val 
fll 755 760 765 

m 

%.\ gca eaa ggg gtg egg gtg egg egg ate ace gtc 2354 

^ Leu Thr Ala His Glu Ala Gin Gly Val Arg Val Arg Arg lie Thr Val 
'T: 770 775 780 

gac tat gee teg cae ace eeg cae gtc gag ctg ate egc gae gaa eta 24 02 
Asp Tyr Ala Ser His Thr Pro His Val Glu Leu lie Arg Asp Glu Leu 
785 790 795 

etc gac ate act age gae age age teg eag ace ccg etc gtg ccg tgg 2450 
Leu Asp lie Thr Ser Asp Ser Ser Ser Gin Thr Pro Leu Val Pro Trp 
800 805 810 

ctg teg ace gtg gae ggc ace tgg gtc gac age ccg ctg gac ggg gag 24 98 
Leu Ser Thr Val Asp Gly Thr Trp Val Asp Ser Pro Leu Asp Gly Glu 
815 820 825 830 

tac tgg tac egg aac ctg egt gaa eeg gtc ggt ttc cae ccc gee gtc 2546 
Tyr Trp Tyr Arg Asn Leu Arg Glu Pro Val Gly Phe His Pro Ala Val 
835 840 845 

age cag ttg cag gcc cag ggc gac acc gtg ttc gtc gag gtc age gee 2 5 94 
Ser Gin Leu Gin Ala Gin Gly Asp Thr Val Phe Val Glu Val Ser Ala 
850 855 860 
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age ccg gtg ttg ttg cag gcg atg gac gac gat gtc gtc acg gtt gcc 2642 
Ser Pro Val Leu Leu Gin Ala Met Asp Asp Asp Val Val Thr Val Ala 
865 870 875 

acg ctg cgt cgt gac gac ggc gac gcc acc egg atg etc ace gcc ctg 2 6 90 
Thr Leu Arg Arg Asp Asp Gly Asp Ala Thr Arg Met Leu Thr Ala Leu 
880 885 890 

gca cag gcc tat gtc cac ggc gtc acc gtc gac tgg cec gcc ate etc 2738 
Ala Gin Ala Tyr Val His Gly Val Thr Val Asp Trp Pro Ala lie Leu 
895 900 905 910 

ggc acc acc aca acc egg gta ctg gac ett ccg acc tac gcc ttc caa 2786 
Gly Thr Thr Thr Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe Gin 
915 920 925 

cac cag egg tac tgg etc gag teg gca cgc ccg gee gca tec gac gcg 2834 
His Gin Arg Tyr Trp Leu Glu Ser Ala Arg Pro Ala Ala Ser Asp Ala 
930 935 940 

ggc cac ccc gtg ctg ggc tec ggt ate gcc etc gcc ggg teg ccg ggc 2882 
Gly His Pro Val Leu Gly Ser Gly lie Ala Leu Ala Gly Ser Pro Gly 
J 945 950 955 

uJ egg gtg ttc acg ggt tec gtg ccg acc ggt gcg gac cgc gcg gtg ttc 2 93 0 
W Arg Val Phe Thr Gly Ser Val Pro Thr Gly Ala Asp Arg Ala Val Phe 
960 965 970 

•s# * - . ■ ■ 

B gtc gcc gag ctg gcg ctg gee gcc gcg gac gcg gtc gac tgc gcc acg 2 97 8 
Q Val Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Val Asp Cys Ala Thr 
ffH 975 980 985 990 

Py 

I^j gtc gag egg etc gac ate gcc tec gtg ccc ggc egg ccg ggc eat ggc 3 02 6 
Val Glu Arg Leu Asp lie Ala Ser Val Pro Gly Arg Pro Gly His Gly 
995 1000 1005 

egg acg acc gta cag acc tgg gtc gac gag ccg gcg gac gac ggc egg 3074 
Arg Thr Thr Val Gin Thr Trp Val Asp Glu Pro Ala Asp Asp Gly Arg 
1010 1015 1020 

cgc egg ttc ace gtg cac acc cgc acc ggc gac gee ccg tgg acg ctg 3122 
Arg Arg Phe Thr Val His Thr Arg Thr Gly Asp Ala Pro Trp Thr Leu 
1025 1030 1035 

cac gcc gag ggg gtg ctg cgc ccc cat ggc acg gcc ctg ccc gat gcg 3170 
His Ala Glu Gly Val Leu Arg Pro His Gly Thr Ala Leu Pro Asp Ala 
1040 1045 1050 

gcc gac gcc gag tgg ccc eca ccg ggc gcg gtg ccc gcg gac ggg ctg 3218 
Ala Asp Ala Glu Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly Leu 
1055 1060 1065 1070 

ccg ggt gtg tgg cgc egg ggg gac cag gtc ttc gcc gag gee gag gtg 3266 
Pro Gly Val Trp Arg Arg Gly Asp Gin Val Phe Ala Glu Ala Glu Val 
1075 1080 1085 
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gac gga ccg gac ggt ttc gtg gtg cac ccc gac ctg etc gac gcg gtc 3314 
Asp Gly Pro Asp Gly Phe Val Val His Pro Asp Leu Leu Asp Ala Val 
1090 1095 1100 

ttc tec gcg gtc ggc gac gga age cgc cag ccg gee gga tgg cgc gac 33 62 
Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Ala Gly Trp Arg Asp 
1105 1110 1115 

ctg aeg gtg cac gcg teg gac gee ace gta ctg cgc gee tgc etc ace 3410 
Leu Thr Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu Thr 
1120 1125 1130 

egg cgc acc gac gga gee atg gga ttc gee gee ttc gac ggc gee ggc 34 58 
Arg Arg Thr Asp Gly Ala Met Gly Phe Ala Ala Phe Asp Gly Ala Gly 
1135 1140 1145 1150 

ctg ccg gta etc ace gcg gag gcg gtg aeg ctg egg gag gtg gcg tea 3506 
Leu Pro Val Leu Thr Ala Glu Ala Val Thr Leu Arg Glu Val Ala Ser 
1155 1160 1165 

ccg tec ggc tee gag gag teg gac ggc ctg cac egg ttg gag tgg etc 3 554 
Pro Ser Gly Ser Glu Glu Ser Asp Gly Leu His Arg Leu Glu Trp Leu 
1170 1175 1180 



h,-^ gcg gtc gee gag gcg gtc tac gac ggt gac ctg ccc gag gga cat gtc 3 602 

yj Ala Val Ala Glu Ala Val Tyr Asp Gly Asp Leu Pro Glu Gly His Val 
H 1185 1190 1195 

01 . ■ 

= ctg ate acc gee gee cac . eec gac gac ccc gag gac ata ccc ace cgc 3650 

Q Leu lie Thr Ala Ala His Pro Asp Asp Pro Glu Asp lie Pro Thr Arg 
fQ 1200 1205 1210 
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gcc cac ace cgc gee acc cgc gtc ctg ace gee ctg caa cac eac etc 3698 
Ala His Thr Arg Ala Thr Arg Val Leu Thr Ala Leu Gin His His Leu 
1215 1220 1225 1230 

acc acc ace gac cac ace etc ate gtc cac acc acc ace gac ccc gcc 3746 
Thr Thr Thr Asp His Thr Leu lie Val His Thr Thr Thr Asp Pro Ala 
1235 1240 1245 

gge gcc ace gtc ace ggc etc acc cgc ace gee eag aac gaa eac eec 3794 
Gly Ala Thr Val Thr Gly Leu Thr Arg Thr Ala Gin Asn Glu His Pro 
1250 1255 1260 

cac cgc ate cgc etc ate gaa ace gac cac eec eac ace ccc etc cee 3842 
His Arg lie Arg Leu lie Glu Thr Asp His Pro His Thr Pro Leu Pro 
1265 1270 1275 

ctg gee caa etc gee ace etc gac eac eec eac etc ege etc acc eac 3890 
Leu Ala Gin Leu Ala Thr Leu Asp His Pro His Leu Arg Leu Thr His 
1280 1285 1290 

cac ace etc cac cae cee cac etc ace eec etc cac ace acc acc cea 3938 
His Thr Leu His His Pro His Leu Thr Pro Leu His Thr Thr Thr Pro 
1295 1300 1305 1310 
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ccc acc acc acc ccc etc aac ccc gaa cac gcc ate ate ate ace ggc 3 986 
Pro Thr Thr Thr Pro Leu Asn Pro Glu His Ala lie lie lie Thr Gly 
1315 1320 1325 

gge tec gge ace etc gee ggc ate etc gee ege cac ctg aac cac ccc 4034 
Gly Ser Gly Thr Leu Ala Gly lie Leu Ala Arg His Leu Asn His Pro 
1330 1335 1340 

cac ace tac etc etc tec cgc acc eca ccc ccc gac gcc acc ccc ggc 4082 
His Thr Tyr Leu Leu Ser Arg Thr Pro Pro Pro Asp Ala Thr Pro Gly 
1345 1350 1355 

ace cac etc ccc tgc gac gtc ggc gac ccc cac caa etc gcc ace ace 413 0 
Thr His Leu Pro Cys Asp Val Gly Asp Pro His Gin Leu Ala Thr Thr 
1360 1365 1370 

etc acc cac ate ccc caa ccc etc ace gcc ate tte cac ace gcc gcc 4178 
Leu Thr His lie Pro Gin Pro Leu Thr Ala lie Phe His Thr Ala Ala 
1375 1380 1385 1390 

=0 ^'^^ 9^*^ 4226 

^ Thr Leu Asp Asp Gly lie Leu His Ala Leu Thr Pro Asp Arg Leu Thr 
^ 1395 1400 1405 

ri 

acc gtc etc cac ccc aaa gee aac gee gee tgg cac ctg cac cac etc 4274 
Thr Val Leu His Pro Lys Ala Asn Ala Ala Trp His Leu His His Leu 
1410 . 1415 1420 

P ■ 

5^ ace caa aac caa ccc etc acc cac tte gtc etc tac tec age gee gee 4322 
p Thr Gin Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala 
m 1425 1430 1435 

m 

9CC gtc etc ggc age ccc gga caa gga aac tac gee gcc gee aac gcc 4370 

J? Ala Val Leu Gly Ser Pro Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala 
^ 1440 1445 1450 

tte etc gac gee etc gcc acc cac cgc cac acc etc ggc caa ccc gee 4418 
Phe Leu Asp Ala Leu Ala Thr His Arg His Thr Leu Gly Gin Pro Ala 
1455 1460 1465 1470 

acc tec ate gcc tgg ggc atg tgg cac acc ace age ace etc acc gga 4466 
Thr Ser lie Ala Trp Gly Met Trp His Thr Thr Ser Thr Leu Thr Gly 
1475 1480 1485 

caa etc gac gac gcc gac egg gac cgc ate cgc cgc ggc ggt tte etc 4514 
Gin Leu Asp Asp Ala Asp Arg Asp Arg lie Arg Arg Gly Gly Phe Leu 
1490 1495 1500 

eeg ate aeg gac gac gag ggc atggggatgc at 4547 
Pro lie Thr Asp Asp Glu Gly 
1505 



<210> 23 
<211> 1509 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 23 

Gin Leu Ala Glu Ala Leu Leu Thr Leu Val Arg Glu Ser Thr Ala Ala 
1.5. 10 15 

Val Leu Gly His Val Gly Gly Glu Asp He Pro Ala Thr Ala Ala Phe 
20 25 30 

Lys Asp Leu Gly He Asp Ser Leu Thr Ala Val Gin Leu Arg Asn Ala 
35 40 45 

Leu Thr Glu Ala Thr Gly Val Arg Leu Asn Ala Thr Ala Val Phe Asp 
50 55 60 

Phe Pro Thr Pro His Val Leu Ala Gly Lys Leu Gly Asp Glu Leu Thr 
65 70 75 80 

Gly Thr Arg Ala Pro Val Val Pro Arg Thr Ala Ala Thr Ala Gly Ala 
85 90 95 

His Asp Glu Pro Leu Ala He Val Gly Met Ala Cys Arg Leu Pro Gly 
100 105 110 

Gly Val Ala Ser Pro Glu Glu Leu Trp His Leu Val Ala Ser Gly Thr 
115 120 125 

Asp Ala He Thr Glu Phe Pro Thr Asp Arg Gly Trp Asp Val Asp Ala 
130 135 140 

He Tyr Asp Pro Asp Pro Asp Ala He Gly Lys Thr Phe Val Arg His 
145 150 155 160 

Gly Gly Phe Leu Thr Gly Ala Thr Gly Phe Asp Ala Ala Phe Phe Gly 
165 170 175 

He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Val Leu 
180 185 190 

Leu Glu Thr Ser Trp Glu Ala Phe Glu Ser Ala Gly He Thr Pro Asp 
195 200 205 

Ser Thr Arg Gly Ser Asp Thr Gly Val Phe Val Gly Ala Phe Ser Tyr 
210 215 220 

Gly Tyr Gly Thr Gly Ala Asp Thr Asp Gly Phe Gly Ala Thr Gly Ser 
225 230 235 240 

Gin Thr Ser Val Leu Ser Gly Arg Leu Ser Tyr Phe Tyr Gly Leu Glu 
245 250 255 

Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
260 265 270 
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Leu His Gin Ala Gly Gin Ser Leu Arg Ser Gly Glu Cys Ser Leu Ala 
275 280 285 

Leu Val Gly Gly Val Thr Val Met Ala Ser Pro Gly Gly Phe Val Glu 
290 295 300 

Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe 
305 310 315 320 

Gly Ala Gly Ala Asp Gly Thr Ser Phe Ala Glu Gly Ala Gly Val Leu 
325 330 335 

lie Val Glu Arg Leu Ser Asp Ala Glu Arg Asn Gly His Thr Val Leu 
340 345 350 

Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
355 360 365 

Leu Ser Ala Pro Asn Gly Pro Ser Gin Glu Arg Val lie Arg Gin Ala 
p 370 375 380 

Leu Ala Asn Ala Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu Ala 
J 385 390 395 400 

p His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Val 
lU 4 05 410 415 

51 Leu Ala Thr Tyr Gly Gin Glu Arg Ala Thr Pro Leu Leu Leu Gly Ser 

5 420 425 . . 430 

ff^ Leu Lys Ser Asn lie Gly His Ala Gin Ala Ala Ser Gly Val Ala Gly 



435 440 445 

lie lie Lys Met Val Gin Ala Leu Arg His Gly Glu Leu Pro Pro Thr 
450 455 460 

Leu His Ala Asp Glu Pro Ser Pro His Val Asp Trp Thr Ala Gly Ala 
465 470 475 480 

Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr Asp Arg Pro 
485 490 495 

Arg Arg Ala Ala Val Ser Ser Phe Gly Val Set Gly Thr Asn Ala His 
500 505 510 

Val lie Leu Glu Ala Gly Pro Val Thr Glu Thr Pro Ala Ala Ser Pro 
515 520 525 

Ser Gly Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu Ala Leu 
530 535 540 

Asp Glu Gin lie Arg Arg Leu Arg Ala Tyr Leu Asp Thr Thr Pro Asp 
545 550 555 560 

Val Asp Arg Val Ala Val Ala Gin Thr Leu Ala Arg Arg Thr His Phe 
565 570 575 
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Ala His Arg Ala Val Leu Leu Gly Asp Thr Val He Thr Thr Pro Pro 
580 585 590 

Ala Asp Arg Pro Asp Glu Leu Val Phe Val Tyr Ser Gly Gin Gly Thr 
595 600 605 

Gin His Pro Ala Met Gly Glu Gin Leu Ala Asp Ser Ser Val Val Phe 
610 615 620 

Ala Glu Arg Met Ala Glu Cys Ala Ala Ala Leu Arg Glu Phe Val Asp 
625 630 635 640 

Trp Asp Leu Phe Thr Val Leu Asp Asp Pro Ala Val Val Asp Arg Val 
645 650 655 

Asp Val Val Gin Pro Ala Ser Trp Ala Met Met Val Ser Leu Ala Ala 
660 665 670 

Val Trp Gin Ala Ala Gly Val Arg Pro Asp Ala Val He Gly His Ser 
675 680 685 

Gin Gly Glu He Ala Ala Ala Cys Val Ala Gly Ala Val Ser Leu Arg 
690 695 700 

Asp Ala Ala Arg He Val Thr Leu Arg Ser Gin Ala He Ala Arg Gly 
705 710 715 720 

Leu Ala Gly Arg Gly Ala Met Ala Ser. Val Ala Leu Pro Ala Gin Asp 
725 730 735 

Val Glu Leu Val Asp Gly Ala Trp He Ala Ala His Asn Gly Pro Ala 
740 745 750 

Ser Thr Val He Ala Gly Thr Pro Glu Ala Val Asp His Val Leu Thr 
755 760 765 

Ala His Glu Ala Gin Gly Val Arg Val Arg Arg He Thr Val Asp Tyr 
770 775 780 

Ala Ser His Thr Pro His Val Glu Leu He Arg Asp Glu Leu Leu Asp 
785 790 795 • 800 

He Thr Ser Asp Ser Ser Ser Gin Thr Pro Leu Val Pro Trp Leu Ser 
805 810 815 

Thr Val Asp Gly Thr Trp Val Asp Ser Pro Leu Asp Gly Glu Tyr Trp 
820 825 830 

Tyr Arg Asn Leu Arg Glu Pro Val Gly Phe His Pro Ala Val Ser Gin 
835 840 845 

Leu Gin Ala Gin Gly Asp Thr Val Phe Val Glu Val Ser Ala Ser Pro 
850 855 860 



Val Leu Leu Gin Ala Met Asp Asp Asp Val Val Thr Val Ala Thr Leu 
865 870 875 880 
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Arg Arg Asp Asp Gly Asp Ala Thr Arg Met Leu Thr Ala Leu Ala Gin 
885 890 895 

Ala Tyr Val His Gly Val Thr Val Asp Trp Pro Ala He Leu Gly Thr 
900 905 910 

Thr Thr Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe Gin His Gin 
915 920 925 

Arg Tyr Trp Leu Glu Ser Ala Arg Pro Ala Ala Ser Asp Ala Gly His 
930 935 940 

Pro Val Leu Gly Ser Gly He Ala Leu Ala Gly Ser Pro Gly Arg Val 
945 950 955 960 

Phe Thr Gly Ser Val Pro Thr Gly Ala Asp Arg Ala Val Phe Val Ala 
965 970 975 

Glu Leu Ala Leu Ala Ala Ala Asp Ala Val Asp Cys Ala Thr Val Glu 
□ 980 985 990 

,Q Arg Leu Asp He Ala Ser Val Pro Gly Arg Pro Gly His Gly Arg Thr 
995 1000 1005 

=^ 

ff, Thr Val Gin Thr Trp Val Asp Glu Pro Ala Asp Asp Gly Arg Arg Arg 
P 1010 1015 1020 

Phe Thr Val His Thr Arg Thr Gly Asp Ala Pro Trp Thr Leu His Ala 
025 1030 1035 1040 

ffi Glu Gly Val Leu Arg Pro His Gly Thr Ala Leu Pro Asp Ala Ala Asp 
pj 1045 1050 1055 

Ala Glu Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly Leu Pro Gly 
1060 1065 1070 

Val Trp Arg Arg Gly Asp Gin Val Phe Ala Glu Ala Glu Val Asp Gly 
1075 1080 1085 

Pro Asp Gly Phe Val Val His Pro Asp Leu Leu Asp Ala Val Phe Ser 
1090 1095 1100 

Ala Val Gly Asp Gly Ser Arg Gin Pro Ala Gly Trp Arg Asp Leu Thr 
105 1110 1115 1120 

Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu Thr Arg Arg 
1125 1130 1135 

Thr Asp Gly Ala Met Gly Phe Ala Ala Phe Asp Gly Ala Gly Leu Pro 
1140 1145 1150 

Val Leu Thr Ala Glu Ala Val Thr Leu Arg Glu Val Ala Ser Pro Ser 
1155 1160 1165 

Gly Ser Glu Glu Ser Asp Gly Leu His Arg Leu Glu Trp Leu Ala Val 
1170 1175 1180 



a 
m 
m 
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Ala Glu Ala Val Tyr Asp Gly Asp Leu Pro Glu Gly His Val Leu lie 
185 1190 1195 1200 

Thr Ala Ala His Pro Asp Asp Pro Glu Asp lie Pro Thr Arg Ala His 
1205 1210 1215 

Thr Arg Ala Thr Arg Val Leu Thr Ala Leu Gin His His Leu Thr Thr 
1220 1225 1230 

Thr Asp His Thr Leu lie Val His Thr Thr Thr Asp Pro Ala Gly Ala 
1235 1240 1245 

Thr Val Thr Gly Leu Thr Arg Thr Ala Gin Asn Glu His Pro His Arg 
1250 1255 1260 

lie Arg Leu lie Glu Thr Asp His Pro His Thr Pro Leu Pro Leu Ala 
265 1270 1275 1280 

Gin Leu Ala Thr Leu Asp His Pro His Leu Arg Leu Thr His His Thr 
1285 1290 1295 

Leu His His Pro His Leu Thr Pro Leu His Thr Thr Thr Pro Pro Thr 
J 1300 1305 1310 

^ Thr Thr Pro Leu Asn Pro Glu His Ala lie lie lie Thr Gly Gly Ser 
O 1315 1320 1325 

w 

M» Gly Thr Leu Ala Gly He Leu Ala Arg His Leu Asn His Pro His Thr 
31 1330 1335 1340 

Q Tyr Leu Leu Ser Arg Thr Pro Pro Pro Asp Ala Thr Pro Gly Thr His 

ff2 -^^^ 1355 1360 

l"! Asp Val Gly Asp Pro His Gin Leu Ala Thr Thr Leu Thr 

I? 1365 1370 1375 

M. His He Pro Gin Pro Leu Thr Ala He Phe His Thr Ala Ala Thr Leu 
1380 1385 1390 

Asp Asp Gly He Leu His Ala Leu Thr Pro Asp Arg Leu Thr Thr Val 
1395 1400 1405 

Leu His Pro Lys Ala Asn Ala Ala Trp His Leu His His Leu Thr Gin 
1410 1415 1420 

Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala Val 
425 1430 1435 1440 

Leu Gly Ser Pro Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu 
1445 1450 1455 

Asp Ala Leu Ala Thr His Arg His Thr Leu Gly Gin Pro Ala Thr Ser 
1460 1465 1470 

He Ala Trp Gly Met Trp His Thr Thr Ser Thr Leu Thr Gly Gin Leu 
1475 1480 1485 
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n 



Asp Asp Ala Asp Arg Asp Arg lie Arg Arg Gly Gly Phe Leu Pro lie 
1490 1495 1500 

Thr Asp Asp Glu Gly 
505 



<210> 24 
<211> 4725 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (3) . . (4724) 

<400> 24 

gc atg egg ctg tac gag gcg gca egg cgc acc gga agt ccc gtg gtg 4 7 

Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val 
15 10 15 

^ gtg gcg gcc gcg etc gac gac gcg ccg gac gtg ccg ctg ctg cgc ggg 95 

Val Ala Ala Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly 
gi 20 25 30 

g ctg egg cgt acg acc gtc egg egt gcc gee gtc egg gaa cgc tct etc 14 3 
P3 Leu Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu 

35 . 40 45 

^ 9CC gac cgc teg ccg tgc tge ccg acg acg age gcg ccg acg cct ccc 191 
f-^ Ala Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro 
50 55 60 

teg cgt teg tec tgg aac age acc gee ace gtg etc ggc eac ctg ggc 239 
Ser Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly 
65 70 75 

gcc gaa gac ate ccg gcg acg acg acg ttc aag gaa etc ggc ate gac 2 87 
Ala Glu Asp lie Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly lie Asp 
80 85 90 95 

teg etc ace gcg gtc eag ctg cgc aac gcg ctg ace acg gcg ace ggc 335 
Ser Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly 
100 105 110 

gta cgc etc aac gcc aca gcg gtc ttc gac ttt ccg acg ccg cgc gcg 3 83 
Val Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro Arg Ala 
115 120 125 

etc gee gcg aga etc ggc gac gag ctg gcc ggt acc cgc gcg ccc gtc 431 
Leu Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val 
130 135 140 
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gcg gcc egg acc gcg gcc acc gcg gcc gcg cac gac gaa ccg ctg gcg 
Ala Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala 
145 150 155 

ate gtg gge atg gee tgc cgt ctg ccg gge ggg gtc gcg teg eea eag 
lie Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin 
160 165 170 175 

gag ctg tgg cgt etc gtc gcg tec gge acc gac gcc ate aeg gag ttc 
Glu Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe 
180 185 190 

cce gcg gac cgc gge tgg gac gtg gac gcg etc tac gac ccg gac cec 
Pro Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro 
195 200 205 

gac gcg ate gge aag acc ttc gtc egg cac gge gge ttc etc gac ggt 
Asp Ala lie Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly 
210 215 220 

gcg ace gge ttc gac gcg gcg ttc ttc ggg ate age ccg cgc gag gee 
Ala Thr Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro Arg Glu Ala 
225 230 235 

ctg gcc atg gac ccg eag caa egg gtg etc ctg gag aeg tec tgg gag 
Leu Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu 
240 245 250 255 

gcg ttc gaa age gcg gge ate ace ccg gac gcg gcg egg. gge age gac 
Ala Phe Glu Ser Ala Gly lie Thr Pro Asp Ala Ala Arg Gly Ser Asp 
260 265 270 

ace gge gtg ttc ate gge gcg ttc tec tac ggg tac gge aeg ggt gcg 
Thr Gly Val Phe lie Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala 
275 280 285 

gat ace aac gge ttc gge gcg aca ggg teg eag ace age gtg etc tec 
Asp Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser 
290 295 300 

gge cgc etc teg tac ttc tac ggt ctg gag gge cet teg gtc aeg gtc 
Gly Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly Pro Ser Val Thr Val 
305 310 315 

gac ace gee tgc teg teg tea ctg gtc gee ctg cac eag gea ggg eag 
Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin 
320 325 330 335 

tec ctg cgc teg gge gaa tgc teg etc gcc ctg gtc gge ggt gtc aeg 
Ser Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr 
340 345 350 



gtg atg gcg teg cce gge gga ttc gtc gag ttc tec egg eag cgc ggg 
Val Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly 
355 360 365 
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etc gcg ccg gac ggg egg geg aag gcg tte gge geg ggc gcg gac ggt 1151 
Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly 
370 375 380 

acg age tte gee gag ggc gee ggt gee etg gtg gtc gag egg etc tee 1199 
Thr Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser 
385 390 395 

gac gcg gag ege eac gge cac acc gtc etc gee etc gta cgc ggc tec 1247 
Asp Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser 
400 405 410 415 

gcg get aac tec gac ggc gcg teg aae ggt etg teg geg ceg aac gge 1295 
Ala Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly 
420 425 430 

ecc tec eag gaa ege gtc ate cac cag gee etc geg aae gcg aaa etc 1343 
Pro Ser Gin Glu Arg Val He His Gin Ala Leu Ala Asn Ala Lys Leu 
435 440 445 

^ acc ecc gee gat gtc gac geg gtc gag geg cac ggc acc gge ace ege 13 91 

Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg 

'^■'^ 450 455 460 

£ 

Q etc gge gac ecc ate gag gcg eag gcg etg etc geg acg tac gga cag 14 3 9 

Uj Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin 

465 470 475 

Oi . ■ ■ 

„ gac egg geg acg ecc etg etg etc ggc teg . etg aag teg aae ate ggg 1487 

Asp Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly 
480 485 490 495 

cac gee eag gee geg tea ggg gtc gee ggg ate ate aag atg gtg eag 153 5 
-4 His Ala Gin Ala Ala Ser Gly Val Ala Gly He He Lys Met Val Gin 
O 500 505 510 

gee ate egg eac ggg gaa etg ccg ceg aea etg cac geg gac gag ccg 1583 
Ala He Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro 
515 520 525 

teg ccg cac gtc gac tgg acg gee ggt gee gtc gag etc etg acg teg 1631 
Ser Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser 
530 535 540 

gee egg ccg tgg ccg ggg ace ggt cgc ccg ege ege get gee gtc teg 1679 
Ala Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Ala Val Ser 
545 550 555 

teg tte ggc gtg age gge acg aac gee eac ate ate ett gag gea gga 1727 
Ser Phe Gly Val Ser Gly Thr Asn Ala His He He Leu Glu Ala Gly 
560 565 570 575 

ccg gtc aaa acg gga ccg gtc gag gea gga geg ate gag gea gga ccg 1775 
Pro Val Lys Thr Gly Pro Val Glu Ala Gly Ala He Glu Ala Gly Pro 
580 585 590 
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gtc gaa gta gga ccg gtc gag get gga ccg etc cce gcg gcg ccg ccg 1823 
Val Glu Val Gly Pro Val Glu Ala Gly Pro Leu Pro Ala Ala Pro Pro 
595 600 605 

tea gca ccg ggc gaa gac ctt ccg ctg etc gtg teg gcg cgt tec ccg 1871 
Ser Ala Pro Gly Glu Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro 
610 615 620 

gag gca etc gac gag cag ate ggg egc ctg cge gee tat etc gac ace 1919 
Glu Ala Leu Asp Glu Gin lie Gly Arg Leu Arg Ala Tyr Leu Asp Thr 
625 630 635 

ggc ccg gge gtc gac egg gcg gee gtg gcg eag aca ctg gee egg cgt 1967 
Gly Pro Gly Val Asp Arg Ala Ala Val Ala Gin Thr Leu Ala Arg Arg 
640 645 650 655 

aeg eac ttc ace cac egg gee gta ctg etc ggg gac ace gtc ate gge 2015 
Thr His Phe Thr His Arg Ala Val Leu Leu Gly Asp Thr Val lie Gly 
660 665 670 

get cce cec gcg gac cag gee gac gaa etc gtc ttc gtc tac tec ggt 2063 
Ala Pro Pro Ala Asp Gin Ala Asp Glu Leu Val Phe Val Tyr Ser Gly 
675 680 685 

cag ggc ace cag cat cce gcg atg ggc gag caa etc gcg gee gcg ttc 2111 
Gin Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala Phe 
690 695 700 

Si ■ ■ . ■ . 

s cce gtg ttc gee gat gee tgg cae gac gcg etc ega egg etc gac gac 2159 
Q Pro Val Phe Ala Asp Ala Trp His Asp Ala Leu Arg Arg Leu Asp Asp 
33 705 710 715 

ftj 

cce gae ccg cac gac cec aca egg age eag cae aeg etc ttc gee eac 22 07 

^ Pro Asp Pro His Asp Pro Thr Arg Ser Gin His Thr Leu Phe Ala His 
^ 720 725 730 735 

eag gcg gcg ttc ace gee etc ctg agg tee tgg gae ate aeg ccg eac 2255 
Gin Ala Ala Phe Thr Ala Leu Leu Arg Ser Trp Asp lie Thr Pro His 
740 745 750 

gee gtc ate ggc cae teg etc ggc gag ate ace gee gcg tac gee gee 2303 
Ala Val lie Gly His Ser Leu Gly Glu lie Thr Ala Ala Tyr Ala Ala 
755 760 765 

ggg ate ctg teg etc gac gac gee tge ace ctg ate ace aeg cgt gee 2351 
Gly lie Leu Ser Leu Asp Asp Ala Cys Thr Leu lie Thr Thr Arg Ala 
770 775 780 

egc etc atg cae aeg ctt ccg ccg cec gge gee atg gtc ace gtg ctg 2399 
Arg Leu Met His Thr Leu Pro Pro Pro Gly Ala Met Val Thr Val Leu 
785 790 795 

aec age gag gag gag gee cgt cag gcg ctg egg ccg gge gtg gag ate 2447 
Thr Ser Glu Glu Glu Ala Arg Gin Ala Leu Arg Pro Gly Val Glu lie 
800 805 810 815 
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SI. 



gcc gcg gtc ttc ggc ccg cac tec gtc gtg etc teg ggc gac gag gac 2495 
Ala Ala Val Phe Gly Pro His Ser Val Val Leu Ser Gly Asp Glu Asp 
820 825 830 

gcc gtg etc gac gtc gca cag egg etc ggc ate cac cac egt ctg cec 2543 
Ala Val Leu Asp Val Ala Gin Arg Leu Gly lie His His Arg Leu Pro 
835 840 845 

gcg ccg cac gcg ggc cac tec gcg cac atg gaa cec gtg gee gcc gag 25 91 
Ala Pro His Ala Gly His Ser Ala His Met Glu Pro Val Ala Ala Glu 
850 855 860 

ctg etc gee ace act cgc gag etc egt tac gac egg cec cac ace gcc 2639 
Leu Leu Ala Thr Thr Arg Glu Leu Arg Tyr Asp Arg Pro His Thr Ala 
865 870 875 

ate ccg aac gac cec acc acc gee gag tac tgg gcc gag cag gtc cgc 2687 
lie Pro Asn Asp Pro Thr Thr Ala Glu Tyr Trp Ala Glu Gin Val Arg 
880 885 890 895 

aac cec gtg ctg ttc cac gee cac ace cag egg tac cec gac gcc gtg 2735 
Asn Pro Val Leu Phe His Ala His Thr Gin Arg Tyr Pro Asp Ala Val 
900 905 910 

ttc gtc gag ate ggc cec ggc cag gac etc tea ccg ctg gtc gac ggc 2783 
Phe Val Glu lie Gly Pro Gly Gin Asp Leu Ser Pro Leu Val Asp Gly 
915 920 925 

ate gcc ctg cag aac ggc aeg gcg gac gag gtg cac gcg ctg cac ace 2831 
lie Ala Leu Gin Asn Gly Thr Ala Asp Glu Val His Ala Leu His Thr 
930 935 940 



gcg etc gcc cgc etc ttc aca cgc ggc gee aeg etc gac tgg tec cgc 2879 
Ala Leu Ala Arg Leu Phe Thr Arg Gly Ala Thr Leu Asp Trp Ser Arg 
O 945 950 955 

ate etc ggc ggt get teg egg cac gac cet gac gtc cec teg tac gcg 2927 
lie Leu Gly Gly Ala Ser Arg His Asp Pro Asp Val Pro Ser Tyr Ala 
960 965 970 975 

ttc cag egg egt cec tac tgg ate gag teg get cec ccg gee aeg gcc 2 975 
Phe Gin Arg Arg Pro Tyr Trp lie Glu Ser Ala Pro Pro Ala Thr Ala 
980 985 990 

gac teg ggc cac cec gtc etc ggc acc gga gtc gcc gtc gcc ggg teg 3023 
Asp Ser Gly His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser 
995 1000 1005 

ccg ggc egg gtg ttc aeg ggt cec gtg cec gcc ggt gcg gac cgc gcg 3071 
Pro Gly Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala 
1010 1015 1020 

gtg ttc ate gcc gaa ctg gcg etc gcc gee gcc gac gcc ace gac tge 3119 
Val Phe lie Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys 
1025 1030 1035 
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gcc acg gtc gaa cag etc gac gtc acc tec gtg ccc gge gga tec gee 3167 
Ala Thr Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala 
1040 1045 1050 1055 

cgc ggc agg gcc acc gcg cag acc tgg gtc gat gaa ccc gcc gcc gac 3215 
Arg Gly Arg Ala Thr Ala Gin Thr Trp Val Asp Glu Pro Ala Ala Asp 
1060 1065 1070 

SS9 egg cgc cgc ttc acc gtc cac acc cgc gtc ggc gac gcc ccg tgg 3263 
Gly Arg Arg Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp 
1075 1080 1085 

acg etg cac gcc gag ggg gtt etc cgc ccc gge cgc gtg ccc cag ccc 3 311 
Thr Leu His Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro 
1090 1095 1100 

gaa gcc gtc gac ace gcc tgg ccc ccg ccg ggc gcg gtg ccc gcg gac 3359 
Glu Ala Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp 
1105 1110 1115 

ggg etg ccc ggg gcg tgg cga cgc gcg gac cag gtc ttc gtc gaa gee 3407 
Gly Leu Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala 
1120 1125 1130 1135 



01 



m 



gaa gtc gac age cct gac ggc ttc gtg gea cac ccc gac etg etc gac 3455 
Glu Val Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp 
1140 1145 1150 

gcg gtc ttc tec gcg gtc ggc gac ggg age cgc cag ccg acc gga tgg 3503 
Ala Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp 
1155 1160 1165 



cgc gac etc gcg gtg cac gcg teg gac gcc ace gtg etg cgc gcc tgc 3551 
4 Arg Asp Leu Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys 
1170 1175 1180 

etc acc cgc cgc gac agt ggt gtc gtg gag etc gee gcc ttc gac ggt 3599 
Leu Thr Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly 
1185 1190 1195 

gcc gga atg ccg gtg etc ace gcg gag teg gtg acg etg ggc gag gtc 3647 
Ala Gly Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val 
1200 1205 1210 1215 

gcg teg gea ggc gga tec gac gag teg gac ggt etg ett egg ett gag 3695 
Ala Ser Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu 
1220 1225 1230 

tgg ttg ccg gtg gcg gag gcc cac tac gac ggt gcc gac gag etg ccc 3743 
Trp Leu Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro 
1235 1240 1245 

gag ggc tac acc etc ate acc gcc aca cac ccc gac gac ccc gac gac 3791 
Glu Gly Tyr Thr Leu He Thr Ala Thr His Pro Asp Asp Pro Asp Asp 
1250 1255 1260 
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ccc acc aac ccc cac aac aca ccc aca cgc acc cac aca caa acc aca 3839 
Pro Thr Asn Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr 
1265 1270 1275 

cgc gtc etc acc gcc etc caa cac cac etc ate acc acc aac cac acc 3887 
Arg Val Leu Thr Ala Leu Gin His His Leu lie Thr Thr Asn His Thr 
1280 1285 1290 1295 

etc ate gtc cac acc acc ace gac ccc cea ggc gcc gcc gtc acc ggc 3935 
Leu lie Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly 
1300 1305 1310 

etc acc cgc acc gca caa aac gaa cac ccc ggc cgc ate cac etc ate 3 983 
Leu Thr Arg Thr Ala Gin Asn Glu His Pro Gly Arg lie His Leu lie 
1315 1320 1325 

gaa ace cac cac ccc cac acc cea etc ccc etc acc caa etc acc acc 4031 
Glu Thr His His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr 
1330 1335 1340 

^ etc cac caa ccc cac eta cgc etc acc aac aac acc etc cac acc ccc 4079 

Leu His Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro 
^ 1345 1350 1355 

O cac etc ace ccc ate ace ace cac cac aac acc acc aca acc acc ccc 4127 

y His Leu Thr Pro lie Thr Thr His His Asn Thr Thr Thr Thr Thr Pro 
M, 1360 1365 1370 1375 

aac acc cea ccc etc aac ccc aac cac gcc ate etc ate acc ggc ggc 4175 
i«i Asn Thr Pro Pro Leu Asn Pro Asn His Ala lie Leu lie Thr Gly Gly 
'% 1380 1385 1390 

JM tec ggc acc etc gcc ggc ate etc gcc cgc cac etc aac cac ccc cac 4223 

Ser Gly Thr Leu Ala Gly lie Leu Ala Arg His Leu Asn His Pro His 
O 1395 1400 1405 

acc tac etc etc tec cgc aca cea eca ccc ccc acc aca ccc ggc acc 4271 
Thr Tyr Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr 
1410 1415 1420 

cac ate ccc tgc gac etc acc gac ccc acc caa ate acc caa gee etc 4319 
His lie Pro Cys Asp Leu Thr Asp Pro Thr Gin lie Thr Gin Ala Leu 
1425 1430 1435 

acc cac ata cea caa ccc etc acc ggc ate ttc cac acc gee gee ace 4367 
Thr His lie Pro Gin Pro Leu Thr Gly Ile/Phe His Thr Ala Ala Thr 
1440 1445 1450 1455 

etc gac gac gcc ace etc acc aac etc acc ccc caa cac etc acc ace 4415 
Leu Asp Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr 
1460 1465 1470 

acc etc caa ccc aaa gee gac gcc gcc tgg cac etc cac cac cac acc 4463 
Thr Leu Gin Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr 
1475 1480 1485 
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caa aac caa ccc etc acc cac ttc gtc etc tac tec age gee gee gee 4511 
Gin Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala 
1490 1495 1500 

ace etc ggc age ccc ggc caa gee aac tac gee gee gee aac gee ttc 4559 
Thr Leu Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Phe 
1505 1510 1515 

etc gac gee etc gee acc cac cgc cac ace caa gga caa ccc gee acc 4607 
Leu Asp Ala Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr 
1520 1525 1530 1535 

acc ate gee tgg ggc atg tgg cac ace acc ace aca etc ace age caa 4655 
Thr He Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin 
1540 1545 1550 

etc acc gac age gac cgc gac cgc ate cgc cgc ggc ggc ttc etg ccg 4 7 03 
Leu Thr Asp Ser Asp Arg Asp Arg lie Arg Arg Gly Gly Phe Leu Pro 
1555 1560 1565 

ate teg gac gac gag ggc atg c 4725 
He Ser Asp Asp Glu Gly Met 
1570 



<210> 25 
<211> 1574 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 



O <400> 25 

H= Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val Val 
15 10 15 

Ala Ala Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly Leu 
20 25 30 

Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu Ala 
35 40 45 

Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro Ser 
50 55 60 

Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly Ala 
65 70 75 80 

Glu Asp lie Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly He Asp Ser 
85 90 95 

Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly Val 
100 105 110 
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Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro Arg Ala Leu 
115 120 125 

Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val Ala 
130 135 140 

Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala lie 
145 150 155 160 

Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin Glu 
165 170 175 

Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe Pro 
180 185 190 

Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro Asp 
195 200 205 

Ala lie Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly Ala 
210 215 220 

Thr Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro Arg Glu Ala Leu 
225 230 235 240 

Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu Ala 
245 250 255 

Phe Glu Ser Ala Gly lie Thr Pro Asp Ala Ala Arg Gly Ser Asp Thr 
260 265 270 

Gly Val Phe lie Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala Asp 
275 280 285 

Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser Gly 
290 295 300 

Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly Pro Ser Val Thr Val Asp 
305 310 315 320 

Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin Ser 
325 330 335 

Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr Val 
340 345 350 

Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly Leu 
355 360 365 

Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly Thr 
370 375 380 

Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser Asp 
385 390 395 400 

Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser Ala 
405 410 415 
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Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro 
420 425 430 

Ser Gin Glu Arg Val lie His Gin Ala Leu Ala Asn Ala Lys Leu Thr 
435 440 445 

Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 
450 455 460 

Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Asp 
465 470 475 480 

Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His 
485 490 495 

Ala Gin Ala Ala Ser Gly Val Ala Gly lie lie Lys Met Val Gin Ala 
500 505 510 

He Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro Ser 
515 520 525 

Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser Ala 
530 535 540 

Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Ala Val Ser Ser 
545 550 555 560 

Phe Gly Val Ser Gly Thr Asn Ala His lie He Leu Glu Ala Gly Pro 
565 570 575 

Val Lys Thr Gly Pro Val Glu Ala Gly Ala He Glu Ala Gly Pro Val 
580 585 590 

Glu Val Gly Pro Val Glu Ala Gly Pro Leu Pro Ala Ala Pro Pro Ser 
595 600 605 

Ala Pro Gly Glu Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu 
610 615 620 

Ala Leu Asp Glu Gin He Gly Arg Leu Arg Ala Tyr Leu Asp Thr Gly 
625 630 635 640 

Pro Gly Val Asp Arg Ala Ala Val Ala Gin Thr Leu Ala Arg Arg Thr 
645 650 655 

His Phe Thr His Arg Ala Val Leu Leu Gly Asp Thr Val He Gly Ala 
660 665 670 

Pro Pro Ala Asp Gin Ala Asp Glu Leu Val Phe Val Tyr Ser Gly Gin 
675 680 685 

Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala Phe Pro 
690 695 700 

Val Phe Ala Asp Ala Trp His Asp Ala Leu Arg Arg Leu Asp Asp Pro 
705 710 715 720 
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Asp Pro His Asp Pro Thr Arg Ser Gin His Thr Leu Phe Ala His Gin 
725 730 735 

Ala Ala Phe Thr Ala Leu Leu Arg Ser Trp Asp lie Thr Pro His Ala 
740 745 750 

Val lie Gly His Ser Leu Gly Glu lie Thr Ala Ala Tyr Ala Ala Gly 
755 760 765 

lie Leu Ser Leu Asp Asp Ala Cys Thr Leu lie Thr Thr Arg Ala Arg 
770 775 780 

Leu Met His Thr Leu Pro Pro Pro Gly Ala Met Val Thr Val Leu Thr 
785 790 795 800 

Ser Glu Glu Glu Ala Arg Gin Ala Leu Arg Pro Gly Val Glu lie Ala 
805 810 815 

Ala Val Phe Gly Pro His Ser Val Val Leu Ser Gly Asp Glu Asp Ala 
^. 820 825 830 

w Val Leu Asp Val Ala Gin Arg Leu Gly lie His His Arg Leu Pro Ala 
'"^ 835 840 845 

f 

Q Pro His Ala Gly His Ser Ala His Met Glu Pro Val Ala Ala Glu Leu 
y 850 855 860 

Mi 

Leu Ala Thr Thr Arg Glu Leu Arg Tyr Asp Arg Pro His Thr. Ala lie 
_ 865 870 875 880 

fl 

^ Pro Asn Asp Pro Thr Thr Ala Glu Tyr Trp Ala Glu Gin Val Arg Asn 
yi 885 890 895 

ry 

Pro Val Leu Phe His Ala His Thr Gin Arg Tyr Pro Asp Ala Val Phe 
a 900 905 910 

Val Glu lie Gly Pro Gly Gin Asp Leu Ser Pro Leu Val Asp Gly lie 
915 920 925 

Ala Leu Gin Asn Gly Thr Ala Asp Glu Val His Ala Leu His Thr Ala 
930 935 940 

Leu Ala Arg Leu Phe Thr Arg Gly Ala Thr Leu Asp Trp Ser Arg lie 
945 950 955 960 

Leu Gly Gly Ala Ser Arg His Asp Pro Asp Val Pro Ser Tyr Ala Phe 
965 970 975 

Gin Arg Arg Pro Tyr Trp He Glu Ser Ala Pro Pro Ala Thr Ala Asp 
980 985 990 

Ser Gly His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser Pro 
995 1000 1005 

Gly Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala Val 
1010 1015 1020 
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Phe lie Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys Ala 
025 1030 1035 104 

Thr Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala Arg 
1045 1050 1055 

Gly Arg Ala Thr Ala Gin Thr Trp Val Asp Glu Pro Ala Ala Asp Gly 
1060 1065 1070 

Arg Arg Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp Thr 
1075 1080 1085 

Leu His Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro Glu 
1090 1095 1100 

Ala Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly 
105 1110 1115 112 

Leu Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala Glu 
1125 1130 1135 

Val Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp Ala 
1140 1145 1150 

Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp Arg 
1155 1160 1165 

Asp Leu Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu 
1170 : 1175 1180 

Thr Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly Ala 
185 1190 1195 120 

Gly Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val Ala 
1205 1210 1215 

Ser Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu Trp 
1220 1225 1230 

Leu Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro Glu 
1235 1240 1245 

Gly Tyr Thr Leu lie Thr Ala Thr His Pro Asp Asp Pro Asp Asp Pro 
1250 1255 1260 

Thr Asn Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr Arg 
265 1270 1275 128 

Val Leu Thr Ala Leu Gin His His Leu lie Thr Thr Asn His Thr Leu 
1285 1290 1295 

lie Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly Leu 
1300 1305 1310 



Thr Arg Thr Ala Gin Asn Glu His Pro Gly Arg lie His Leu lie Glu 
1315 1320 1325 
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Thr His His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr Leu 

1330 1335 1340 

His Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro His 

345 1350 1355 136 

Leu Thr Pro He Thr Thr His His Asn Thr Thr Thr Thr Thr Pro Asn 
1365 1370 1375 

Thr Pro Pro Leu Asn Pro Asn His Ala He Leu He Thr Gly Gly Ser 
1380 1385 1390 

Gly Thr Leu Ala Gly He Leu Ala Arg His Leu Asn His Pro His Thr 
1395 1400 1405 

Tyr Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr His 
1410 1415 1420 

He Pro Cys Asp Leu Thr Asp Pro Thr Gin He Thr Gin Ala Leu Thr 
425 1430 1435 144 

His He Pro Gin Pro Leu Thr Gly He Phe His Thr Ala Ala Thr Leu 
1445 1450 1455 

Asp Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr Thr 
1460 1465 1470 

Leu Gin Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr Gin 
1475 1480 1485 



Asn Gin Pro Leu Thr His Phe Val 
1490 1495 

Leu Gly Ser Pro Gly Gin Ala Asn 
505 1510 



Leu Tyr Ser Ser Ala Ala Ala Thr 
1500 

Tyr Ala Ala Ala Asn Ala Phe Leu 

1515 152 



Asp Ala Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr Thr 
1525 1530 1535 

He Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin Leu 
1540 1545 1550 

Thr Asp Ser Asp Arg Asp Arg He Arg Arg Gly Gly Phe Leu Pro He 
1555 1560 1565 

Ser Asp Asp Glu Gly Met 
1570 



<210> 26 
<211> 4674 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence : DNA encoding synthetic 
PKS synthase fragment 
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<220> 

<221> CDS 

<222> (3) . . (4673) 

<400> 26 

gc atg egg ctg tac gag gcg gca egg cgc acc gga agt ccc gtg gtg 

Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val 

1 5 10 15 

gtg gcg gcc gcg etc gac gac gcg ccg gac gtg ccg ctg ctg cgc ggg 
Val Ala Ala Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly 
20 25 30 

ctg egg cgt acg acc gtc egg cgt gcc gee gtc egg gaa cgc tet etc 
Leu Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu 
35 40 45 

gcc gac cgc teg ccg tgc tge ccg acg acg age gcg ccg acg cet ccc 
Ala Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro 
50 55 60 

teg cgt teg tec tgg aac age ace gee acc gtg etc gge eac ctg ggc 
Ser Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly 
65 70 75 

gee gaa gac ate ccg gcg acg acg acg ttc aag gaa etc ggc ate gac 
Ala Glu Asp lie Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly lie Asp 
80 85 . . 90 95 

teg etc acc gcg gtc cag ctg cgc aac gcg ctg acc acg gcg acc ggc 
Ser Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly 
100 105 110 

gta cgc etc aac gcc aca gcg gtc ttc gac ttt ccg acg ccg cgc gcg 
Val Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro Arg Ala 
115 120 125 

etc gcc gcg aga etc ggc gac gag ctg gcc ggt ace cgc gcg ccc gtc 
Leu Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val 
130 135 140 

geg gee egg acc gcg gee acc gcg gcc gcg eac gac gaa ccg ctg gcg 
Ala Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala 
145 150 155 

ate gtg gge atg gee tgc cgt ctg ccg ggc ggg gtc gcg teg eca cag 
lie Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin 
160 165 170 175 

gag ctg tgg cgt etc gtc gcg tec ggc acc gac gcc ate acg gag ttc 
Glu Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe 
180 185 190 

ccc gcg gac cgc ggc tgg gac gtg gac geg etc tac gac ccg gac ccc 
Pro Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro 
195 200 205 



gac gcg ate ggc aag ace ttc gtc egg eac ggc ggc ttc etc gac ggt 
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Asp 


Ala 


He 


Gly 


Lys 


Thr 


Phe 


Val 


Arg 


His 


Gly 


Gly 


Phe 


Leu 


Asp 


Gly 










210 










215 










220 














acc 


yy 


ttc 


gac 


y^y 


aca 
y^y 


ttc 


ttc 


yyy 


ate 


age 


ccg 


cgc 


ydy 


gcc 


719 




Ala 


Thr 


Gly 


Phe 


Asp 


Ala 


Ala 


Phe 


Phe 


Gly 


He 


Ser 


Pro 


Arg 


Glu 


Ala 








225 










230 










235 
















gcc 


atcr 


gac 


ccg 


cag 


caa 


caa 
^y y 


ata 

y "-y 


etc 




ycty 


acg 


tec 


i-yy 


asa 
ydy 


767 




Leu 


Ala 


Met 


Asp 


Pro 


Gin 


Gin 


Arg 


Val 


Leu 


Leu 


Glu 


Thr 


Ser 


Trp 


Glu 






240 










24 5 










250 










255 






y 


t tc 


gaa 


age 


y ^y 


yy ^ 


ate 


ace 




gac 




y*-y 


L-yy 


aac 

yy ^ 


age 


gac 


815 




Ala 


Phe 


Glu 


Ser 


Ala 


Gly 


He 


Thr 


Pro 


Asp 


Ala 


Ala 


Arg 


Gly 


Ser 


Asp 














260 










265 










270 








acc 




gtg 


ttc 


ate 


ggc 


gcg 


ttc 


tec 


tac 


ggg 


tac 


ggc 


acg 


ggt 


gcg 


863 




Thr 


Gly 


Val 


Phe 


He 


Gly 


Ala 


Phe 


Ser 


Tyr 


Gly 


Tyr 


Gly 


Thr 


Gly 


Ala 












275 










280 










285 










9a t 


acc 


aac 


yy ^ 


ttc 


crcfc 

yy 


aca 
y *-y 


aca 


cracr 

yyy 


teg 


cag 


ace 


age 


y '-y 


etc 


tec 


911 




Asp 


Thr 


Asn 


Gly 


Phe 


Gly 


Ala 


Thr 


Gly 


Ser 


Gin 


Thr 


Ser 


Val 


Leu 


Ser 










290 










295 










300 












ggc 


cgc 


etc 


teg 


tac 


ttc 


tac 


ggt 


ctg 


gag 


ggc 


cct 


teg 


gtc 


acg 


gtc 


959 


± 

a 
m 


Gly 


Arg 


Leu 


Ser 


Tyr 


Phe 


Tyr 


Gly 


Leu 


Glu 


Gly 


Pro 


Ser 


Val 


Thr 


Val 






305 










310 










315 














gac 


acc 


gcc 


tac 
"-y ^ 


tea 


tea 


tea 


eta 


ate 


gcc 


^ uy 


cac 


cag 


gea 


aaa 
yyy 


cag 


1007 


m 


Asp 


Thr 


Ala 


Cys 


Ser 


Ser 


Ser 


Leu 


Val 


Ala 


Leu 


His 


Gin 


Ala 


Gly 


Gin 






320 










325 










330 










335 










































tec 


ctg 


cgc 


teg 


ggc 


gaa 


tgc 


teg 


etc 


gcc 


ctg 


gtc 


ggc 


ggt 


gtc 


acg 


1055 


S 


Ser 


Leu 


Arg 


Ser 


Gly 


Glu 


Cys 


Ser 


Leu 


Ala 


Leu 


Val 


Gly 


Gly 


Val 


Thr 














340 










345 










350 






a 


gtg 


atg 


gcg 


teg 


ecc 


ggc 


gga 


ttc 


gtc 


gag 


ttc 


tec 


egg 


cag 


cgc 


ggg 


1103 




Val 


Met 


Ala 


Ser 


Pro 


Gly 


Gly 


Phe 


Val 


Glu 


Phe 


Ser 


Arg 


Gin 


Arg 


Gly 





355 360 365 

etc gcg ccg gac ggg egg gcg aag gcg ttc ggc gcg ggc gcg gac ggt 1151 

Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly 
370 375 380 

acg age ttc gee gag ggc gee ggt gcc ctg gtg gtc gag egg etc tec 1199 

Thr Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser 
385 390 395 

gac gcg gag cgc cac ggc cac ace gtc etc gee etc gta cgc ggc tec 1247 

Asp Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser 

400 405 410 415 

gcg get aac tee gac ggc gcg teg aac ggt ctg teg gcg ccg aac ggc 1295 

Ala Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly 
420 425 430 
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ccc 


tec 




Pro 


Ser 




acc 


ccc 




Thr 


Pro 




etc 


gge 




Leu 


Gly 






465 




gac 


egg 




Asp 


Arg 




480 






cac 


gee 




His 


Ala 




gcc 


ate 


• ^ 


Ala 


He 














Q 


teg 


ecg 


5 , ' 

yd 


Ser 


Pro 








m 








gcc 


egg 




Ala 


Arg 


ff* 




545 


m 


tec 


ttc 


Ser 


Phe 


P 


560 












ccc 


ccc 




Pro 


Pro 




tgg 


gtg 




Trp 


Val 




cae 


gag 




His 


Glu 




atg 


egg 




Met 


Arg 






625 




eac 


cgt 



His Arg 
640 



cag gaa ege gte 
Gin Glu Arg Val 
435 

gee gat gtc gac 
Ala Asp Val Asp 
450 

gae ccc ate gag 
Asp Pro He Glu 



geg acg ccc etg 
Ala Thr Pro Leu 
485 

cag gee geg tea 
Gin Ala Ala Ser 
500 

egg cae ggg gaa 
Arg His Gly Glu 
515 

cac gte gae tgg 
His Val Asp Trp 
530 

eeg tgg ecg ggg 
Pro Trp Pro Gly 



ggg ate agt gge 
Gly He Ser Gly 
565 

act cag cet geg 
Thr Gin Pro Ala 
580 

eeg ttg gtg att 
Pro Leu Val He 
595 

gge egg ttg cgt 
Gly Arg Leu Arg 
610 

get gtg gea teg 
Ala Val Ala Ser 



gee gtg etg etg 
Ala Val Leu Leu 
645 



ate eac cag gee 
He His Gin Ala 
440 

geg gte gag geg 
Ala Val Glu Ala 
455 

geg cag geg etg 
Ala Gin Ala Leu 
470 

etg etc gge teg 
Leu Leu Gly Ser 



ggg gte gcc ggg 
Gly Val Ala Gly 
505 

etg ecg ecg aca 
Leu Pro Pro Thr 
520 

acg gcc ggt gee 
Thr Ala Gly Ala 
535 

ace ggt cge cet 
Thr Gly Arg Pro 
550 

ace aae gcc cac 
Thr Asn Ala His 



gac aae geg gtg 
Asp Asn Ala Val 
585 

teg gee agg ace 
Ser Ala Arg Thr 
600 

geg tat etg geg 
Ala Tyr Leu Ala 
615 

acg etg geg atg 
Thr Leu Ala Met 
630 

gga gat gac ace 
Gly Asp Asp Thr 



etc geg aae geg 
Leu Ala Asn Ala 
445 

eac gge ace gge 
His Gly Thr Gly 
460 

etc geg acg tac 
Leu Ala Thr Tyr 
475 

etg aag teg aae 
Leu Lys Ser Asn 
490 

ate ate aag atg 
He He Lys Met 



etg cac geg gac 
Leu His Ala Asp 
525 

gtc gag etc etg 
Val Glu Leu Leu 
540 

agg egg gea gge 
Arg Arg Ala Gly 
555 

gtc ate etg gaa 
Val He Leu Glu 
570 

ate gag egg gea 
He Glu Arg Ala 



cag teg get ttg 
Gin Ser Ala Leu 
605 

geg teg ccc ggg 
Ala Ser Pro Gly 
620 

aca egg teg gtg 
Thr Arg Ser Val 
635 

gtc acc gge acc 
Val Thr Gly Thr 
650 



aaa etc 1343 
Lys Leu 



ace cge 1391 
Thr Arg 



gga cag 14 3 9 
Gly Gin 



ate ggg 14 87 
He Gly 
495 

gtg cag 153 5 

Val Gin 

510 

gag ecg 1583 
Glu Pro 



acg teg 1631 
Thr Ser 



gtg teg 1679 
Val Ser 



age gea 172 7 
Ser Ala 
575 

ecg gag 1775 

Pro Glu 

590 

act gag 1823 
Thr Glu 



gtg gat 1871 
Val Asp 



ttc gag 1919 
Phe Glu 



get gtg 1967 
Ala Val 
655 



92 



tct gac cct egg gcg gtg ttc gtc ttc ccg gga cag ggg teg cag cgt 2015 
Ser Asp Pro Arg Ala Val Phe Val Phe Pro Gly Gin Gly Ser Gin Arg 
660 665 670 

get ggc atg ggt gag gaa ctg gcc gcc gcg ttc ccc gtc ttc gcg egg 2063 
Ala Gly Met Gly Glu Glu Leu Ala Ala Ala Phe Pro Val Phe Ala Arg 
675 680 685 

ate cat cag cag gtg tgg gac ctg etc gat gtg ccc gat ctg gag gtg 2111 
lie His Gin Gin Val Trp Asp Leu Leu Asp Val Pro Asp Leu Glu Val 
690 695 700 

aac gag acc ggt tac gcc cag ccg gcc ctg ttc gea atg cag gtg get 215 9 
Asn Glu Thr Gly Tyr Ala Gin Pro Ala Leu Phe Ala Met Gin Val Ala 
705 710 715 

ctg ttc ggg ctg ctg gaa teg tgg ggt gta ega ccg gac gcg gtg ate 2207 
Leu Phe Gly Leu Leu Glu Ser Trp Gly Val Arg Pro Asp Ala Val lie 
720 725 730 735 

O ggc eat teg gtg ggt gag ctt gcg get gcg tat gtg tec ggg gtg tgg 2255 
SU Gly His Ser Val Gly Glu Leu Ala Ala Ala Tyr Val Ser Gly Val Trp 

740 745 750 

Q teg ttg gag gat gee tgc act ttg gtg teg gcg egg get cgt ctg atg 2 3 03 

III Ser Leu Glu Asp Ala Cys Thr Leu Val Ser Ala Arg Ala Arg Leu Met 
755 760 765 

ffl - ■ 

cag get ctg ccc gcg ggt ggg gtg atg gtc get gtc ccg gtc teg gag 2351 

Gin Ala Leu Pro Ala Gly Gly Val Met Val Ala Val Pro Val Ser Glu 
770 775 780 



rU gat gag gcc egg gee gtg ctg ggt gag ggt gtg gag ate gcc gcg gtc 2 3 99 

Asp Glu Ala Arg Ala Val Leu Gly Glu Gly Val Glu lie Ala Ala Val 
£3 785 790 795 



aac ggc ccg teg teg gtg gtt etc tec ggt gat gag gee gcc gtg ctg 2447 
Asn Gly Pro Ser Ser Val Val Leu Ser Gly Asp Glu Ala Ala Val Leu 
800 805 810 815 

cag gcc gcg gag ggg ctg ggg aag tgg acg egg ctg gcg acc age eac 2495 
Gin Ala Ala Glu Gly Leu Gly Lys Trp Thr Arg Leu Ala Thr Ser His 
820 825 830 

gcg ttc eat tec gcc cgt atg gaa ccc atg ctg gag gag ttc egg gcg 2543 
Ala Phe His Ser Ala Arg Met Glu Pro Met Leu Glu Glu Phe Arg Ala 
835 840 845 

gtc gee gaa ggc ctg ace tac egg acg ccg cag gtc tec atg gcc gtt 2591 
Val Ala Glu Gly Leu Thr Tyr Arg Thr Pro Gin Val Ser Met Ala Val 
850 855 860 



ggt gat cag gtg ace acc get 
Gly Asp Gin Val Thr Thr Ala 
865 870 



gag tac 
Glu Tyr 



tgg gtg egg cag gtc egg gac 
Trp Val Arg Gin Val Arg Asp 
875 



2639 
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acg gtc egg ttc ggc gag cag gtg gcc teg tac gag gac gcc gtg ttc 2 687 
Thr Val Arg Phe Gly Glu Gin Val Ala Ser Tyr Glu Asp Ala Val Phe 
880 885 ^ 890 895 

gtc gag ctg ggt gcc gac egg tea ctg gee ege ctg gtc gac ggt gtc 2735 
Val Glu Leu Gly Ala Asp Arg Ser Leu Ala Arg Leu Val Asp Gly Val 
900 905 910 

gcg atg ctg cae ggc gae cac gaa ate cag gcc gcg ate ggc gcc ctg 2783 
Ala Met Leu His Gly Asp His Glu lie Gin Ala Ala lie Gly Ala Leu 
915 920 925 

gcc cae ctg tat gtc aac ggc gtc acg gtc gae tgg ecc gcg etc ctg 2831 
Ala His Leu Tyr Val Asn Gly Val Thr Val Asp Trp Pro Ala Leu Leu 
930 935 940 

ggc gat get ccg gca aca egg gtg ctg gac ett ccg aca tac gee ttc 2879 
Gly Asp Ala Pro Ala Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe 
945 950 955 

O cag cae cag ege tac tgg etc gag teg get ecc ccg gcc acg gee gae 2927 
Gin His Gin Arg Tyr Trp Leu Glu Ser Ala Pro Pro Ala Thr Ala Asp 
960 965 970 975 

Q teg ggc cae cec gtc etc ggc ace gga gtc gcc gtc gcc ggg teg ccg 2 975 
Ser Gly His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser Pro 

r," 980 .985 990 

r— 

ggc egg gtg ttc acg ggt ecc gtg ecc gee ggt gcg gac ege gcg gtg 3 023 
Gly Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala Val 
995 1000 1005 

fIJ ttc ate gee gaa ctg gcg etc gee gee gcc gac gee ace gac tge gee 3071 
SJ Phe lie Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys Ala 
Q 1010 1015 1020 

acg gtc gaa cag etc gac gtc ace tec gtg ecc ggc gga tec gcc ege 3119 
Thr Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala Arg 
1025 1030 1035 

ggc agg gee ace gcg cag acc tgg gtc gat gaa cec gee gcc gac ggg 3167 
Gly Arg Ala Thr Ala Gin Thr Trp Val Asp Glu Pro Ala Ala Asp Gly 
1040 1045 1050 1055 

egg ege ege ttc acc gtc cae acc ege gtc ggc gae gcc ccg tgg acg 3215 
Arg Arg Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp Thr 
1060 1065 1070 

ctg cac gee gag ggg gtt etc ege cec ggc cgc gtg cec cag ecc gaa 3263 
Leu His Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro Glu 
1075 1080 1085 

gee gtc gae acc gee tgg cec ccg ccg ggc gcg gtg ecc gcg gae ggg 3 311 
Ala Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly 
1090 1095 1100 
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ctg ccc ggg gcg tgg cga cgc gcg gac cag gtc ttc gtc gaa gcc gaa 33 59 
Leu Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala Glu 
1105 1110 1115 

gtc gac age cot gac ggc ttc gtg gca cac ccc gac ctg etc gac gcg 3407 
Val Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp Ala 
1120 1125 1130 1135 

gtc ttc tec gcg gtc ggc gac ggg age cgc cag ccg acc gga tgg cgc 3455 
Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp Arg 
1140 1145 1150 

gac etc gcg gtg eac gcg teg gac gee acc gtg ctg cgc gcc tge etc 3503 
Asp Leu Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu 
1155 1160 1165 

acc cgc cgc gac agt ggt gtc gtg gag etc gee gcc ttc gac ggt gee 3551 
Thr Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly Ala 
1170 1175 1180 

O gga atg ccg gtg etc acc gcg gag teg gtg acg ctg ggc gag gtc gcg 3599 
Gly Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val Ala 
1185 1190 1195 

Q teg gca ggc gga tec gac gag teg gac ggt ctg ett egg ett gag tgg 3647 
2 Ser Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu Trp 
r: 1200 1205 1210 1215 



m 



ttg ccg gtg gcg gag gee cac tac gac ggt gee gac gag ctg ccc gag 3695 

Leu Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro Glu 
1220 1225 1230 

ggc tac ace etc ate acc gcc aca cac ccc gac gac ccc gac gac ccc 3743 

Gly Tyr Thr Leu He Thr Ala Thr His Pro Asp Asp Pro Asp Asp Pro 
1235 1240 1245 

ace aac ece eac aac aca ccc aca ege ace cac aca caa ace aca cgc 3791 

Thr Asn Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr Arg 
1250 1255 1260 

gtc etc acc gee etc caa eac cac etc ate acc ace aac eac acc etc 3839 

Val Leu Thr Ala Leu Gin His His Leu He Thr Thr Asn His Thr Leu 
1265 1270 1275 

ate gtc cac ace acc ace gac ece cea ggc gcc gee gtc acc ggc etc 3887 

He Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly Leu 
1280 1285 1290 1295 

acc ege acc gca caa aac gaa cac ccc ggc cgc ate cac etc ate gaa 3935 

Thr Arg Thr Ala Gin Asn Glu His Pro Gly Arg He His Leu He Glu 
1300 1305 1310 

ace eac cac ece eac acc eca etc ccc etc acc caa etc ace ace etc 3983 

Thr His His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr Leu 
1315 1320 1325 
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cac caa ccc cac eta cgc etc acc aac aac acc etc cac ace ccc cac 4031 
His Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro His 
1330 1335 1340 

etc acc ccc ate acc acc cac cac aac ace acc aca acc acc ccc aac 4079 
Leu Thr Pro He Thr Thr His His Asn Thr Thr Thr Thr Thr Pro Asn 
1345 1350 1355 

ace eca ccc etc aac ccc aac cac gee ate etc ate ace ggc ggc tec 4127 
Thr Pro Pro Leu Asn Pro Asn His Ala He Leu He Thr Gly Gly Ser 
1360 1365 1370 1375 

ggc ace etc gee ggc ate etc gee cgc cac etc aac cac ccc cac acc 4175 
Gly Thr Leu Ala Gly He Leu Ala Arg His Leu Asn His Pro His Thr 
1380 1385 1390 

tac etc etc tee cgc aca eca eca ccc ccc acc aca ccc ggc ace cac 4223 
Tyr Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr His 
1395 1400 1405 

ate ccc tgc gae etc acc gac ccc acc caa ate acc caa gee etc acc 4271 
He Pro Cys Asp Leu Thr Asp Pro Thr Gin He Thr Gin Ala. Leu Thr 
1410 1415 1420 



p cac ata eca caa ccc etc acc ggc ate ttc cac acc gee gee acc etc 4319 
His He Pro Gin Pro Leu Thr Gly He Phe His Thr Ala Ala Thr Leu 
1425 1430 1435 



gac gac gee acc etc ace aac etc acc ccc caa cac etc acc acc acc 4367 
^ Asp Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr Thr 
1440 1445 1450 1455 

ttl 

ftl etc caa ccc aaa gee gae gee gee tgg cac etc cac cac cac ace caa 4415 

Leu Gin Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr Gin 
Q 1460 1465 1470 

^ aac caa ccc etc acc cac ttc gtc etc tac tec age gee gee gee acc 4463 
Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala Thr 
1475 1480 1485 

etc ggc age ccc ggc caa gee aac tac gee gee gee aac gee ttc etc 4511 
Leu Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Phe Leu 
1490 1495' 1500 

gac gee etc gee ace cac cgc cac ace caa gga caa ccc gee acc ace 4559 
Asp Ala Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr Thr 
1505 1510 1515 

ate gee tgg ggc atg tgg cac ace ace acc aca etc ace age caa etc 4607 
He Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin Leu 
1520 1525 1530 1535 

acc gac age gae cgc gac cgc ate cgc cgc ggc ggc ttc ctg ccg ate 4655 
Thr Asp Ser Asp Arg Asp Arg He Arg Arg Gly Gly Phe Leu Pro He 
1540 1545 1550 
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teg gac gac gag ggc atg c 4 674 

Ser Asp Asp Glu Gly Met 
1555 



<210> 27 
<211> 1557 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKiS 
synthase fragment 

<400> 27 

Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val Val 
15 10 15 

Ala Ala Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly Leu 
^ 20 25 30 

^fl Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu Ala 
35 40 45 

■Q Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro Ser 
s.j 50 55 60 

Ms 

Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly Ala 
^'65 70 75 80 

y .Glu Asp lie Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly lie Asp Ser 
^ 85 90 95 

ni 

Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly Val 
p 100 105 110 

Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro Arg Ala Leu 
115 120 125 

Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val Ala 
130 135 140 

Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala lie 
145 150 155 160 

Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin Glu 
165 170 175 

Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe Pro 
180 185 190 

Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro Asp 
195 200 205 

Ala lie Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly Ala 
210 215 220 
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Thr Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro Arg Glu Ala Leu 
225 230 235 240 

Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu Ala 
245 250 255 

Phe Glu Ser Ala Gly lie Thr Pro Asp Ala Ala Arg Gly Ser Asp Thr 
260 265 270 

Gly Val Phe lie Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala Asp 
275 280 285 

Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser Gly 
290 295 300 

Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly Pro Ser Val Thr Val Asp 
305 310 315 320 

Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin Ser 
325 330 335 

Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr Val 
340 345 350 

Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly Leu 
355 360 365 

Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly Thr 
370 375 . . 380 

Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser Asp 
385 390 395 400 

Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser Ala 
405 410 415 

Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro 
420 425 430 

Ser Gin Glu Arg Val lie His Gin Ala Leu Ala Asn Ala Lys Leu Thr 
435 440 445 

Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 
450 455 460 

Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Asp 
465 470 475 480 

Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His 
485 490 495 

Ala Gin Ala Ala Ser Gly Val Ala Gly lie lie Lys Met Val Gin Ala 
500 505 510 



lie Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro Ser 
515 520 525 
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Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser Ala 
530 535 540 

Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser 
545 550 555 560 

Phe Gly lie Ser Gly Thr Asn Ala His Val lie Leu Glu Ser Ala Pro 
565 570 575 

Pro Thr Gin Pro Ala Asp Asn Ala Val lie Glu Arg Ala Pro Glu Trp 
580 585 590 

Val Pro Leu Val lie Ser Ala Arg Thr Gin Ser Ala Leu Thr Glu His 
595 600 605 

Glu Gly Arg Leu Arg Ala Tyr Leu Ala Ala Ser Pro Gly Val Asp Met 
610 615 620 

Arg Ala Val Ala Ser Thr Leu Ala Met Thr Arg Ser Val Phe Glu His 
625 630 635 640 

'% Arg Ala Val Leu Leu Gly Asp Asp Thr Val Thr Gly Thr Ala Val Ser 

'"O 645 650 655 

%cj Asp Pro Arg Ala Val Phe Val Phe Pro Gly Gin Gly Ser Gin Arg Ala 
W 660 665 670 

.M» 

Gly Met Gly Glu Glu Leu Ala Ala Ala Phe Pro Val Phe Ala Arg lie 
J 675 680 685 

His Gin Gin Val Trp Asp Leu Leu Asp Val Pro Asp Leu Glu Val Asn 
^ 690 695 700 

Glu Thr Gly Tyr Ala. Gin Pro Ala Leu Phe Ala Met Gin Val Ala Leu 
O 705 710 715 720 

Phe Gly Leu Leu Glu Ser Trp Gly Val Arg Pro Asp Ala Val He Gly 
725 730 735 

His Ser Val Gly Glu Leu Ala Ala Ala Tyr Val Ser Gly Val Trp Ser 
740 745 750 

Leu Glu Asp Ala Cys Thr Leu Val Ser Ala Arg Ala Arg Leu Met Gin 
755 760 765 

Ala Leu Pro Ala Gly Gly Val Met Val Ala Val Pro Val Ser Glu Asp 
770 775 780 

Glu Ala Arg Ala Val Leu Gly Glu Gly Val Glu He Ala Ala Val Asn 
785 790 795 800 

Gly Pro Ser Ser Val Val Leu Ser Gly Asp Glu Ala Ala Val Leu Gin 
805 810 815 



Ala Ala Glu Gly Leu Gly Lys Trp Thr Arg Leu Ala Thr Ser His Ala 
820 825 830 
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Phe His Ser Ala Arg Met Glu Pro Met Leu Glu Glu Phe Arg Ala Val 
835 840 845 

Ala Glu Gly Leu Thr Tyr Arg Thr Pro Gin Val Ser Met Ala Val Gly 
850 855 860 

Asp Gin Val Thr Thr Ala Glu Tyr Trp Val Arg Gin Val Arg Asp Thr 
865 870 875 880 

Val Arg Phe Gly Glu Gin Val Ala Ser Tyr Glu Asp Ala Val Phe Val 
885 890 895 

Glu Leu Gly Ala Asp Arg Ser Leu Ala Arg Leu Val Asp Gly Val Ala 
900 905 910 

Met Leu His Gly Asp His Glu He Gin Ala Ala He Gly Ala Leu Ala 
915 920 925 

His Leu Tyr Val Asn Gly Val Thr Val Asp Trp Pro Ala Leu Leu Gly 
930 935 940 

Asp Ala Pro Ala Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe Gin 
945 950 955 960 

His Gin Arg Tyr Trp Leu Glu Ser Ala Pro Pro Ala Thr Ala Asp Ser 
965 970 975 

Gly His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser Pro Gly 
980 985 990 

Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala Val Phe 
995 1000 1005 

He Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys Ala Thr 
1010 1015 1020 

Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala Arg Gly 
025 1030 1035 104 

Arg Ala Thr Ala Gin Thr Trp Val Asp Glu Pro Ala Ala Asp Gly Arg 
1045 1050 1055 

Arg Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp Thr Leu 
1060 1065 1070 

His Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro Glu Ala 
1075 1080 1085 

Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly Leu 
1090 1095 1100 

Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala Glu Val 
105 1110 1115 112 

Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp Ala Val 
1125 1130 1135 
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Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp Arg Asp 
1140 1145 1150 

Leu Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu Thr 
1155 1160 1165 

Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly Ala Gly 
1170 1175 1180 

Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val Ala Ser 
185 1190 1195 120 

Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu Trp Leu 
1205 1210 1215 

Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro Glu Gly 
1220 1225 1230 

Tyr Thr Leu lie Thr Ala Thr His Pro Asp Asp Pro Asp Asp Pro Thr 
1235 1240 1245 

Asn Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr Arg Val 
1250 1255 1260 

Leu Thr Ala Leu Gin His His Leu lie Thr Thr Asn His Thr Leu lie 
265 1270 1275 128 

Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly Leu Thr 
1285 1290 1295 

Arg Thr Ala Gin Asn Glu His Pro Gly Arg lie His Leu lie Glu Thr 
1300 1305 1310 

His His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr Leu His 
1315 1320 1325 

Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro His Leu 
1330 1335 1340 

Thr Pro lie Thr Thr His His Asn Thr Thr Thr Thr Thr Pro Asn Thr 
345 1350 1355 136 

Pro Pro Leu Asn Pro Asn His Ala lie Leu lie Thr Gly Gly Ser Gly 
1365 1370 1375 

Thr Leu Ala Gly lie Leu Ala Arg His Leu Asn His Pro His Thr Tyr 
1380 1385 1390 

Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr His lie 
1395 1400 1405 

Pro Cys Asp Leu Thr Asp Pro Thr Gin lie Thr Gin Ala Leu Thr His 
1410 1415 1420 



lie Pro Gin Pro Leu Thr Gly lie Phe His Thr Ala Ala Thr Leu Asp 
425 1430 1435 144 
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Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr Thr Leu 
1445 1450 1455 

Gin Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr Gin Asn 
1460 1465 1470 

Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala Thr Leu 
1475 1480 1485 

Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp 
1490 1495 1500 

Ala Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr Thr lie 
505 1510 1515 152 

; : Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin Leu Thr 

1525 1530 1535 

Asp Ser Asp Arg Asp Arg lie Arg Arg Gly Gly Phe Leu Pro lie Ser 
1540 1545 1550 

'-^ Asp Asp Glu Gly Met 
1555 



□ <210> 28 
m <211> 4767 
£^ <212> DNA 

ffj <213> Artificial Sequence 
<220> 

y <223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

ff! 

%J <220> 

p <221> CDS 

yi. <222> (3) . . (4766) 

<400> 28 

gc atg egg ctg tac gag gcg gca egg cgc acc gga agt ccc gtg gtg 4 7 

Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val 

15 10 15 

gtg gcg gcc gcg etc gac gac gcg ccg gac gtg ccg ctg ctg cgc ggg 95 
Val Ala Ala Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly 
20 25 30 

ctg egg cgt aeg ace gtc egg cgt gee gcc gte egg gaa cgc tct etc 143 
Leu Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu 
35 40 45 

gcc gac cgc teg ccg tgc tgc ccg aeg aeg age gcg ccg aeg cct ccc 191 
Ala Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro 

50 55 , . 60 



teg cgt teg tec tgg aac age acc gee acc gtg etc ggc eac ctg ggc 
Ser Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly 
65 70 75 



239 
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gcc gaa gac ate ccg gcg acg acg acg ttc aag gaa etc ggc ate gac 287 
Ala Glu Asp lie Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly lie Asp 
80 85 90 95 

teg cte aee geg gtc cag ctg cge aae geg ctg aee aeg gcg ace ggc 335 
Ser Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly 
100 105 110 

gta cge etc aae gee aea geg gtc ttc gac ttt ccg acg ccg cgc gcg 383 
Val Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro Arg Ala 
115 120 125 

etc gcc gcg aga etc ggc gac gag ctg gcc ggt ace cgc gcg ccc gtc 431 
Leu Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val 
130 135 140 

gcg gcc egg ace gcg gcc ace gcg gcc gcg cac gac gaa ccg ctg gcg 479 
Ala Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala 
145 150 155 

ate gtg ggc atg gcc tge egt ctg ccg ggc ggg gtc gcg teg eca cag 527 
He Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin 
160 165 170 175 



B gag ctg tgg egt etc gtc gcg tec ggc ace gac gee ate acg gag ttc 575 
ijij Glu Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe 
U ISO 185 190 



CCC gcg gac cgc ggc tgg gac gtg gac gcg etc tac gac ccg gac ccc 623 
Pro Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro 
195 200 205 



ry gac gcg ate ggc aag ace ttc gtc egg cac ggc ggc ttc etc gac ggt 671 

'^J Asp Ala He Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly 
P 210 215 220 

gcg ace ggc ttc gac geg gcg ttc ttc ggg ate age ccg cge gag gee 719 

Ala Thr Gly Phe Asp Ala Ala Phe Phe Gly He Ser Pro Arg Glu Ala 
225 230 235 

ctg gee atg gac ccg cag caa egg gtg cte ctg gag acg tec tgg gag 767 

Leu Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu 
240 245 250 255 

gcg ttc gaa age gcg ggc ate ace ccg gac gcg geg egg ggc age gac 815 

Ala Phe Glu Ser Ala Gly He Thr Pro Asp Ala Ala Arg Gly Ser Asp 
260 265 270 

aee ggc gtg ttc ate ggc geg ttc tee tac ggg tac ggc acg ggt gcg 863 

Thr Gly Val Phe He Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala 

275 280 285 

gat ace aae ggc ttc ggc gcg aca ggg teg cag ace age gtg etc tec 911 

Asp Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser 
290 295 300 
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ggc cgc etc teg tac tte tae ggt ctg gag ggc act teg gtc acg gtc 959 
Gly Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly P^o Ser Val Thr Val 
305 310 315 

gac ace gee tge teg teg tea ctg gtc gee etg eac cag gca ggg cag 1007 
Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin 
320 325 330 335 

tec ctg cge teg ggc gaa tgc teg etc gee ctg gtc ggc ggt gtc acg 1055 
Ser Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr 
340 345 350 

gtg atg gcg teg ecc ggc gga ttc gtc gag tte tee egg cag cge ggg 1103 
Val iviet Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly 
355 360 365 

etc gcg ccg gae ggg egg gcg aag gcg tte ggc gcg ggc gcg gac ggt 1151 
Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly 
370 375 380 

ac9 ttc gee gag ggc gee ggt gee ctg gtg gtc gag egg etc tec 1199 

Thr Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser 
385 390 395 

gae gcg gag cgc cae ggc cac ace gtc etc gee etc gta cgc ggc tec 124 7 
Asp Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser 
400 405 410 415 

gcg get aac tec gac ggc gcg teg aac ggt ctg teg gcg ccg aac ggc 1295 
Ala Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly 
420 425 430 

ecc tec cag gaa cge gtc ate eac cag gee etc gcg aac gcg aaa etc 1343 
Pro Ser Gin Glu Arg Val lie His Gin Ala Leu Ala Asn Ala Lys Leu 
435 440 445 

ace ecc gee gat gtc gae gcg gtc gag gcg cac ggc ace ggc acc cgc 13 91 
Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg 
450 455 460 

etc ggc gac ecc ate gag gcg cag gcg ctg etc gcg acg tac gga cag 1439 
Leu Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin 
465 470 475 

gac egg gcg acg ecc ctg ctg etc ggc teg ctg aag teg aac ate ggg 1487 
Asp Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly 
480 485 490 495 

cac gee cag gee gcg tea ggg gtc gee ggg ate ate aag atg gtg cag 1535 
His Ala Gin Ala Ala Ser Gly Val Ala Gly lie lie Lys Met Val Gin 
500 505 510 

gcc ate egg eac ggg gaa ctg ccg ccg aca etg cac gcg gac gag ccg 1583 
Ala lie Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro 
515 520 525 
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teg ccg cac gtc gac tgg acg gcc ggt gcc gtc gag etc ctg acg teg 1631 
Ser Pro His Val Asp Tirp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser 
530 535 540 

gee egg ceg tgg ccg ggg aee ggt cgc cct agg egg gcg ggc gtg teg 1679 
Ala Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Gly Val Ser 
545 550 555 

tec tte gga gtc age ggc ace aac gee eae gtc ate ctg gag age gca 1727 
Ser Phe Gly Val Ser Gly Thr Asn Ala His Val lie Leu Glu Ser Ala 
560 565 570 575 

cee ece get eag eee gcg gag gag gcg cag cet gtt gag acg ceg gtg 1775 
Pro Pro Ala Gin Pro Ala Glu Glu Ala Gin Pro Val Glu Thr Pro Val 
580 585 590 

gtg gee teg gat gtg ctg ccg ctg gtg ata teg gcc aag ace eag ccc 1823 
Val Ala Ser Asp Val Leu Pro Leu Val lie Ser Ala Lys Thr Gin Pro 
595 600 605 

gcc ctg aee gaa cac gaa gac egg ctg cgc gcc tac ctg gcg gcg teg 1871 

Ala Leu Thr Glu His Glu Asp Arg Leu Arg Ala Tyr Leu Ala Ala Ser 
610 615 620 

f 

h3 ccc ggg gcg gat ata egg get gtg gca teg acg ctg gcg gtg aca egg 1919 

W Pro Gly Ala Asp lie Arg Ala Val Ala Ser Thr Leu Ala Val Thr Arg 
H 625 630 635 

3 . teg gtg tte gag cac cgc gcc gta etc ett gga gat gac aee gtc ace 1967 

Ser Val Phe Glu His Arg Ala Val Leu Leu Gly Asp Asp Thr Val Thr 

640 645 650 655 



ggc acc gcg gtg ace gac ccc agg ate gtg ttt gtc ttt ccc ggg cag 2015 
Gly Thr Ala Val Thr Asp Pro Arg lie Val Phe Val Phe Pro Gly Gin 
660 665 670 

999 tgg eag tgg ctg ggg atg ggc agt gca ctg cgc gat teg teg gtg 2063 
Gly Trp Gin Trp Leu Gly Met Gly Ser Ala Leu Arg Asp Ser Ser Val 
675 680 685 

gtg tte gee gag egg atg gee gag tgt gcg gcg gcg ttg cgc gag tte 2111 
Val Phe Ala Glu Arg Met Ala Glu Cys Ala Ala Ala Leu Arg Glu Phe 
690 695 700 

gtg gac tgg gat ctg tte acg gtt ctg gat gat ccg gcg gtg gtg gac 2159 
Val Asp Trp Asp Leu Phe Thr Val Leu Asp Asp Pro Ala Val Val Asp 
705 710 715 

egg gtt gat gtg gtc cag ccc get tec tgg gcg atg atg gtt tec ctg 2207 
Arg Val Asp Val Val Gin Pro Ala Ser Trp Ala Met Met Val Ser Leu 
720 725 730 735 

gcc gcg gtg tgg cag gcg gcc ggt gtg egg ccg gat gcg gtg ate ggc 2255 
Ala Ala Val Trp Gin Ala Ala Gly Val Arg Pro Asp Ala Val lie Gly 
740 745 750 
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cat teg cag ggt gag ate gee gea get tgt gtg geg ggt gcg gtg tea 2303 
His Ser Gin Gly Glu lie Ala Ala Ala Cys Val Ala Gly Ala Val Ser 
755 760 765 

eta cge gat gee gee egg ate gtg aec ttg cge age eag geg ate gee 2351 
Leu Arg Asp Ala Ala Arg He Val Thr Leu Arg Ser Gin Ala He Ala 
770 775 780 

egg gge etg geg gge egg gge gcg atg gea tee gtc gee etg cee geg 23 99 
Arg Gly Leu Ala Gly Arg Gly Ala Met Ala Ser Val Ala Leu Pro Ala 
785 790 795 

cag gat gtc gag etg gtc gae ggg gee tgg ate gee gee cae aac ggg -2447 
Gin Asp Val Glu Leu Val Asp Gly Ala Trp He Ala Ala His Asn Gly 
800 805 810 815 

cee gee tee ace gtg ate geg gge ace ceg gaa gcg gtc gae eat gtc 24 95 
Pro Ala Ser Thr Val He Ala Gly Thr Pro Glu Ala Val Asp His Val 
820 825 830 

etc ace get eat gag gea caa ggg gtg egg gtg egg egg ate aec gtc 254 3 
Leu Thr Ala His Glu Ala Gin Gly Val Arg Val Arg Arg He Thr Val 
835 840 845 



O gae tat gee teg cae ace ceg cae gtc gag etg ate cge gae gaa eta 2591 

y Asp Tyr Ala Ser His Thr Pro His Val Glu Leu He Arg Asp Glu Leu 
H 850 855 860 

m . 

2- etc gae ate act age gae age age teg cag aec ceg etc gtg ceg tgg 263 9 

Q Leu Asp He Thr Ser Asp Ser Ser Ser Gin Thr Pro Leu Val Pro Trp 
ffi 865 870 875 



etg teg aec gtg gae gge acc tgg gtc gae age ceg etg gae ggg gag 2687 
^ Leu Ser Thr Val Asp Gly Thr Trp Val Asp Ser Pro Leu Asp Gly Glu 
880 885 890 895 

tac tgg tac egg aac etg egt gaa ceg gtc ggt tte cae cee gee gtc 2735 
Tyr Trp Tyr Arg Asn Leu Arg Glu Pro Val Gly Phe His Pro Ala Val 
900 905 910 

age cag ttg cag gee cag gge gae ace gtg tte gtc gag gtc age gee 2783 
Ser Gin Leu Gin Ala Gin Gly Asp Thr Val Phe Val Glu Val Ser Ala 
915 920 925 

age ceg gtg ttg ttg eag geg atg gae gae gat gtc gtc acg gtt gee 2831 
Ser Pro Val Leu Leu Gin Ala Met Asp Asp Asp Val Val Thr Val Ala 
930 935 940 

acg etg egt egt gae gae gge gae gee acc egg atg etc acc gee etg 2879 
Thr Leu Arg Arg Asp Asp Gly Asp Ala Thr Arg Met Leu Thr Ala Leu 
945 950 955 

gea eag gee tat gtc cac gge gtc acc gtc gae tgg cee gee ate etc 2927 
Ala Gin Ala Tyr Val His Gly Val Thr Val Asp Trp Pro Ala He Leu 
960 965 970 975 
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ggc acc acc aca acc egg gta ctg gac ctt ccg acc tac gcc ttc caa 
Gly Thr Thr Thr Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe Gin 
, 980 985 990 

cac cag egg tac tgg etc gag teg get cec ccg gcc acg gcc gac teg 
His Gin Arg Tyr Trp Leu Glu Ser Ala Pro Pro Ala Thr Ala Asp Ser 
99S 1000 1005 

ggc cac cec gtc etc ggc acc gga gtc gcc gtc gcc ggg teg ccg ggc 
Gly His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser Pro Gly 
1010 1015 1020 

egg gtg ttc acg ggt cec gtg cec gcc ggt gcg gac cgc gcg gtg ttc 
Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala Val Phe 
1025 1030 1035 

ate gee gaa ctg gcg etc gee gee gcc gac gee acc gac tgc gcc acg 
lie Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys Ala Thr 
1040 1045 1050 1055 

gtc gaa cag etc gac gtc ace tec gtg cec ggc gga tec gee cgc ggc 
Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala Arg Gly 
1060 1065 1070 

agg gcc ace gcg cag acc tgg gtc gat gaa cec gee gcc gac ggg egg 
Arg Ala Thr Ala Gin Thr Trp Val Asp Glu Pro Ala Ala Asp Gly Arg 
1075 1080 1085 

cgc cgc ttc ace gtc cac ace cgc gtc ggc gac gee ccg tgg acg ctg 
Arg Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp Thr Leu 
1090 1095 1100 

cac gcc gag ggg gtt etc cgc cec ggc cgc gtg cec cag cec gaa gee 
His Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro Glu Ala 
1105 1110 1115 

gtc gac ace gcc tgg cec ccg ccg ggc gcg gtg cec gcg gac ggg ctg 
Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly Leu 
1120 1125 1130 1135 

cec ggg gcg tgg ega cgc gcg gac cag gtc ttc gtc gaa gcc gaa gtc 
Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala Glu Val 
1140 1145 1150 

gac age ect gac ggc ttc gtg gea cac ece gac ctg etc gac gcg gtc 
Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp Ala Val 
1155 1160 1165 

ttc tee gcg gtc ggc gac ggg age cgc cag ccg ace gga tgg cgc gac 
Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp Arg Asp 
1170 1175 1180 

etc gcg gtg cac gcg teg gac gee acc gtg ctg cgc gcc tgc etc acc 
Leu Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu Thr 
1185 1190 1195 



# 
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cgc cgc gac agt ggt gtc gtg gag etc gcc gcc ttc gac ggt gcc gga 3647 
Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly Ala Gly 
1200 1205 1210 1215 

atg ccg gtg etc acc gcg gag teg gtg acg ctg gge gag gtc gcg teg 3 695 
Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val Ala Ser 
1220 1225 1230 

gca gge gga tec gae gag teg gac ggt ctg ctt egg ctt gag tgg ttg 3743 
Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu Trp Leu 
1235 1240 1245 

ccg gtg gcg gag gcc cae tac gae ggt gee gae gag ctg ccc gag gge 3 791 
Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro Glu Gly 
1250 1255 1260 

tac acc etc ate ace gcc aca cac cee gac gae ccc gac gae cec aec 3 83 9 
Tyr Thr Leu He Thr Ala Thr His Pro Asp Asp Pro Asp Asp Pro Thr 
1265 1270 1275 

3 3887 
^ Asn Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr Arg Val 
^ 1280 1285 1290 1295 

4=. 

W etc acc gee etc caa cae cac etc ate acc ace aac cac ace etc ate 3935 

W Leu Thr Ala Leu Gin His His Leu He Thr Thr Asn His Thr Leu He 

1300 1305 1310 

fft • " - 

^ gtc cac ace acc ace gac ccc eca gge gcc gcc gtc aec gge etc acc 3 983 

Q Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly Leu Thr 
J 1315 1320 1325 

m 

cgc acc gca caa aac gaa cae cec gge cgc ate cac etc ate gaa aec 4 031 

^ Arg Thr Ala Gin Asn Glu His Pro Gly Arg lie His Leu He Glu Thr 
j-^ 1330 1335 1340 

cac cac cec cac aec eca etc cee etc acc caa etc acc aec etc cac 4079 
His His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr Leu His 
1345 1350 1355 

caa ccc cac eta cgc etc aec aac aac aec etc cae ace ccc cac etc 4127 
Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro His Leu 
1360 1365 1370 1375 

ace ccc ate acc acc cac cae aac ace ace aca ace acc ccc aac ace 4175 
Thr Pro He Thr Thr His His Asn Thr Thr Thr Thr Thr Pro Asn Thr 
1380 1385 1390 

eca cec etc aac ccc aac cac gee ate etc ate acc gge gge tec gge 4223 
Pro Pro Leu Asn Pro Asn His Ala He Leu He Thr Gly Gly Ser Gly 
1395 1400 1405 

acc etc gee gge ate etc gee cgc eac etc aac cae ccc cac ace tac 4271 
Thr Leu Ala Gly He Leu Ala Arg His Leu Asn His Pro His Thr Tyr 
1410 1415 1420 
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•~ 1 



etc etc tec cgc aca cca cca ccc ccc acc aca cce ggc ace cac ate 4319 
Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr His lie ■ 
1425 1430 1435 

ccc tgc gac etc acc gac cce ace caa ate acc caa gee etc acc cac 4367 
Pro Cys Asp Leu Thr Asp Pro Thr Gin He Thr Gin Ala Leu Thr His 
1440 1445 1450 1455 

ata cca caa cce etc ace ggc ate ttc cac acc gee gee acc etc gac 4415 
He Pro Gin Pro Leu Thr Gly He Phe His Thr Ala Ala Thr Leu Asp 
1460 1465 1470 

gac gee ace etc ace aac etc ace ccc caa cac etc acc acc acc etc 4463 
Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr Thr Leu 
1475 1480 1485 

caa ccc aaa gee gac gee gee tgg cac etc cac cac cac acc caa aac 4511 
Gin Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr Gin Asn 
1490 1495 1500 

caa cce etc ace cac ttc gtc etc tac tec age gee gee gee ace etc 4559 
Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala Thr Leu 
1505 1510 1515 



^ ggc age ccc ggc caa gee aac tac gee gee gee aac gee ttc etc gac 4607 

Ly Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp 

H 1520 1525 1530 1535 

m' ' . ■ . 

„ gee etc gee ace cac cgc cac ace caa gga caa ccc gee ace acc ate 4655 

Q Ala Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr Thr He 

2 1540 1545 1550 



gee tgg ggc atg tgg cac ace ace ace aca etc acc age caa etc acc 4703 
N Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin Leu Thr 
3 1555 1560 1565 

gac age gac cgc gac cgc ate cgc cgc ggc ggc ttc ctg ccg ate teg 4751 
Asp Ser Asp Arg Asp Arg He Arg Arg Gly Gly Phe Leu Pro He Ser 
1570 1575 1580 

gac gac gag ggc atg c 4 767 

Asp Asp Glu Gly Met 
1585 



<210> 29 
<211> 1588 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 29 

Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val Val 
15 10 15 
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Ala Ala Ala Leu 
20 

Arg Arg Thr Thr 
35 

Asp Arg Ser Pro 
50 

Arg Ser Ser Trp 
65 

Glu Asp lie Pro 



Leu Thr Ala Val 
100 

Arg Leu Asn Ala 
115 

Ala Ala Arg Leu 
130 

Ala Arg Thr Ala 
145 

Val Gly Met Ala 



Leu Trp Arg Leu 
180 

Ala Asp Arg Gly 
195 

Ala lie Gly Lys 
210 

Thr Gly Phe Asp 
225 

Ala Met Asp Pro 



Phe Glu Ser Ala 
260 

Gly Val Phe lie 
275 

Thr Asn Gly Phe 
290 

Arg Leu Ser Tyr 
305 



Asp Asp Ala Pro 



Val Arg Arg Ala 
40 

Cys Cys Pro Thr 
55 

Asn Ser Thr Ala 
70 

Ala Thr Thr Thr 
85 

Gin Leu Arg Asn 



Thr Ala Val Phe 
120 

Gly Asp Glu Leu 
135 

Ala Thr Ala Ala 
150 

Cys Arg Leu Pro 
165 

Val Ala Ser Gly 



Trp Asp Val Asp 
200 

Thr Phe Val Arg 
215 

Ala Ala Phe Phe 
230 

Gin Gin Arg Val 
245 

Gly lie Thr Pro 



Gly Ala Phe Ser 
280 

Gly Ala Thr Gly 
295 

Phe Tyr Gly Leu 
310 



Asp Val Pro Leu 
25 

Ala Val Arg Glu 



Thr Ser Ala Pro 
60 

Thr Val Leu Gly 
75 

Phe Lys Glu Leu 
90 

Ala Leu Thr Thr 
105 

Asp Phe Pro Thr 



Ala Gly Thr Arg 
140 

Ala His Asp Glu 
155 

Gly Gly Val Ala 
170 

Thr Asp Ala lie 
185 

Ala Leu Tyr Asp 



His Gly Gly Phe 
220 

Gly lie Ser Pro 
235 

Leu Leu Glu Thr 
250 

Asp Ala Ala Arg 
265 

Tyr Gly Tyr Gly 



Ser Gin Thr Ser 
300 

Glu Gly Pro Ser 
315 



Leu Arg Gly Leu 
30 

Arg Ser Leu Ala 
45 

Thr Pro Pro Ser 



His Leu Gly Ala 
80 

Gly lie Asp Ser 
95 

Ala Thr Gly Val 
110 

Pro Arg Ala Leu 
125 

Ala Pro Val Ala 



Pro Leu Ala lie 
160 

Ser Pro Gin Glu 
175 

Thr Glu Phe Pro 
190 

Pro Asp Pro Asp 
205 

Leu Asp Gly Ala 



Arg Glu Ala Leu 
240 

Ser Trp Glu Ala 
255 

Gly Ser Asp Thr 
270 

Thr Gly Ala Asp 
285 

Val Leu Ser Gly 



Val Thr Val Asp 
320 



110 

Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin Ser 
325 330 335 

Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr Val 
340 345 350 

Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly Leu 
355 360 365 

Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly Thr 
370 375 380 

Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser Asp 
385 390 395 400 

Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser Ala 
405 410 415 

Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro As n Gly Pro 
420 425 430 

Q 

^ Ser Gin Glu Arg Val lie His Gin Ala Leu Ala Asn Ala Lys Leu Thr 
435 440 445 

^ Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 
y 450 455 460 

^ Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Asp 
01 465 470 475 480 

s • 

Q Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His 
^ 485 490 495 

H ^ 

Ala Gin Ala Ala Ser Gly Val Ala Gly lie lie Lys Met Val Gin Ala 
^ 500 505 510 

^ He Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro Ser 
515 520 525 

Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser Ala 
530 535 540 

Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser 
545 550 555 560 

Phe Gly Val Ser Gly Thr Asn Ala His Val He Leu Glu Ser Ala Pro 
565 570 575 

Pro Ala Gin Pro Ala Glu Glu Ala Gin Pro Val Glu Thr Pro Val Val 
580 585 590 



Ala Ser Asp Val 
595 

Leu Thr Glu His 
610 



Leu Pro Leu Val 
600 

Glu Asp Arg Leu 
615 



He Ser Ala Lys 

Arg Ala Tyr Leu 
620 



Thr Gin Pro Ala 
605 

Ala Ala Ser Pro 



Ill 



Gly Ala Asp lie Arg Ala Val Ala Ser Thr Leu Ala Val Thr Arg Ser 
625 630 635 640 

Val Phe Glu His Arg Ala Val Leu Leu Gly Asp Asp Thr Val Thr Gly 
645 650 655 

Thr Ala Val Thr Asp Pro Arg He Val Phe Val Phe Pro Gly Gin Gly 
660 665 670 

Trp Gin Trp Leu Gly Met Gly Ser Ala Leu Arg Asp Ser Ser Val Val 
.675 680 685 

Phe Ala Glu Arg Met Ala Glu Cys Ala Ala Ala Leu Arg Glu Phe Val 
690 695 700 

Asp Trp Asp Leu Phe Thr Val Leu Asp Asp Pro Ala Val Val Asp Arg 
705 710 715 720 

Val Asp Val Val Gin Pro Ala Ser Trp Ala Met Met Val Ser Leu Ala 
725 730 735 



Ala Val Trp Gin Ala Ala Gly Val Arg Pro Asp Ala Val He Gly His 
f 740 745 750 

sis:* 

Q Ser Gin Gly Glu He Ala Ala Ala Cys Val Ala Gly Ala Val Ser Leu 
W 755 760 765 

ffl Arg Asp Ala Ala Arg He Val Thr Leu Arg Ser Gin Ala He Ala Arg 

s ' ■ - 770 775 . 780 
Pi 

^ Gly Leu Ala Gly Arg Gly Ala Met Ala Ser Val Ala Leu Pro Ala Gin 

J 785 790 795 800 

H Asp Val Glu Leu Val Asp Gly Ala Trp He Ala Ala His Asn Gly Pro 
P 805 810 815 

Ala Ser Thr Val He Ala Gly Thr Pro Glu Ala Val Asp His Val Leu 
820 825 830 

Thr Ala His Glu Ala Gin Gly Val Arg Val Arg Arg He Thr Val Asp 
835 840 845 

Tyr Ala Ser His Thr Pro His Val Glu Leu He Arg Asp Glu Leu Leu 
850 855 860 

Asp He Thr Ser Asp Ser Ser Ser Gin Thr Pro Leu Val Pro Trp Leu 
865 870 875 880 

Ser Thr Val Asp Gly Thr Trp Val Asp Ser Pro Leu Asp Gly Glu Tyr 
885 890 895 

Trp Tyr Arg Asn Leu Arg Glu Pro Val Gly Phe His Pro Ala Val Ser 
900 905 910 

Gin Leu Gin Ala Gin Gly Asp Thr Val Phe Val Glu Val Ser Ala Ser 
915 920 925 
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Pro Val Leu Leu Gin Ala Met Asp Asp Asp Val Val Thr Val Ala Thr 
930 935 940 

Leu Arg Arg Asp Asp Gly Asp Ala Thr Arg Met Leu Thr Ala Leu Ala 
945 950 955 960 

Gin Ala Tyr Val His Gly Val Thr Val Asp Trp Pro Ala lie Leu Gly 
965 970 975 

Thr Thr Thr Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe Gin His 
980 985 990 

Gin Arg Tyr Trp Leu Glu Ser Ala Pro Pro Ala Thr Ala Asp Ser Gly 
995 1000 1005 

His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser Pro Gly Arg 
1010 1015 1020 

Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala Val Phe lie 
025 1030 1035 104 

Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys Ala Thr Val 
1045 1050 1055 

Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala Arg Gly Arg 
1060 1065 1070 

Ala Thr Ala Gin Thr Trp Val Asp Glu Pro Ala Ala Asp Gly' Arg Arg 
1075 1080 1085 ' 

Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp Thr Leu His 
1090 1095 1100 

Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro Glu Ala Val 
105 1110 1115 112 

Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly Leu Pro 
1125 1130 1135 

Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala Glu Val Asp 
1140 1145 1150 

Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp Ala Val Phe 
1155 1160 1165 

Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp Arg Asp Leu 
1170 1175 1180 

Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu Thr Arg 
185 1190 1195 120 

Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly Ala Gly Met 
1205 1210 1215- 

Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val Ala Ser Ala 
1220 1225 1230 
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Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu Trp Leu Pro 
1235 1240 1245 

Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro Glu Gly Tyr 
1250 1255 1260 

Thr Leu He Thr Ala Thr His Pro Asp Asp Pro Asp Asp Pro Thr Asn 
265 1270 1275 128 

Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr Arg Val Leu 
1285 1290 1295 

Thr Ala Leu Gin His His Leu He Thr Thr Asn His Thr Leu He Val 
1300 1305 1310 

His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly Leu Thr Arg 
1315 1320 1325 

Thr Ala Gin Asn Glu His Pro Gly Arg He His Leu He Glu Thr His 
1330 1335 1340 

His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr Leu His Gin 
345 1350 1355 136 

Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro His Leu Thr 
1365 1370 1375 

Pro He Thr Thr His His Asn Thr Thr Thr Thr Thr Pro Asn Thr Pro 
1380 1385 1390 

Pro Leu Asn Pro Asn His Ala He Leu He Thr Gly Gly Ser Gly Thr 
1395 1400 1405 

Leu Ala Gly He Leu Ala Arg His Leu Asn His Pro His Thr Tyr Leu 
1410 1415 1420 

Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr His He Pro 
425 1430 1435 144 

Cys Asp Leu Thr Asp Pro Thr Gin He Thr Gin Ala Leu Thr His He 
1445 1450 1455 

Pro Gin Pro Leu Thr Gly He Phe His Thr Ala Ala Thr Leu Asp Asp 
1460 1465 1470 

Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr Thr Leu Gin 
1475 1480 1485 

Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr Gin Asn Gin 
1490 1495 1500 

Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala Thr Leu Gly 
505 1510 1515 152 

Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Ala 
1525 1530 1535 
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Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr Thr He Ala 
1540 1545 1550 

Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin Leu Thr Asp 
1555 1560 1565 

Ser Asp Arg Asp Arg He Arg Arg Gly Gly Phe Leu Pro He Ser Asp 
1570 1575 1580 

Asp Glu Gly Met 
585 

<210> 30 
<211> 4737 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
Q PKS synthase fragment 

^ <220> 
3^ <221> CDS 
5 <222> (3) . . (4736) 

W <400> 30 

N gc atg egg ctg tac gag gcg gca egg cgc acc gga agt ccc gtg gta 47 
01 Met Arg Leii Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val 
L ^ 5 10 15 

^3 

S gtg gcg gcc gcg etc gac gac gcg ccg gac gtg ccg ctg ctg cgc ggg 95 
PJ Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly 

<j 20 25 30 

ctg egg cgt aeg acc gtc egg cgt gee gee gtc egg gaa cgc tct etc 
r Leu Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu 
35 40 45 

gee gac cgc teg ccg tgc tgc ccg aeg aeg age gcg ccg acg cct ccc 
Ala Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro 
50 55 60 



143 
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teg cgt teg tec tgg aac age ace gcc acc gtg etc ggc eac ctg ggc 23 9 
Ser Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly 
65 70 75 

gcc gaa gac ate ccg gcg acg aeg acg tte aag gaa etc ggc ate gac 287 
Ala Glu Asp He Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly He Asp 
80 85 90 95 

teg etc ace gcg gtc cag ctg cgc aac gcg ctg acc acg gcg acc ggc 335 
Ser Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly 
100 105 110 

gta cgc etc aac gcc aca gcg gtc tte gac ttt ccg acg ccg cgc gcg 383 
Val Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro^ Arg Ala 
115 120 125 
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etc gcc gcg aga etc gge gac gag ctg gee ggt ace cge gcg ccc gtc 431 
Leu Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val 
130 135 140 

gcg gcc egg acc gcg gee ace gcg gee gcg cac gac gaa eeg ctg gcg 4 79 
Ala Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala 
145 150 155 

ate gtg gge atg gee tgc egt ctg eeg gge ggg gtc gcg teg cea cag 52 7 
lie Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin 
160 165 170 175 

gag ctg tgg cgt etc gtc gcg tec gge ace gac gee ate acg gag ttc 575 
Glu Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe 
180 185 190 • 

ccc gcg gac cge gge tgg gae gtg gac gcg etc tac gac eeg gac cec 623 
Pro Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro 
195 200 205 

gac gcg ate gge aag acc ttc gtc egg cac gge gge ttc etc gac ggt 671 
^fl Asp Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly 

^ 210 215 220 

^ gcg ace gge ttc gac gcg gcg ttc ttc ggg ate age eeg cge gag gee 719 
^ Ala Thr Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro Arg Glu Ala 
225 230 235 

81 . 

s ctg gee atg gac eeg cag caa egg gtg etc ctg gag acg tec tgg gag 767 
Q Leu Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu 
m 240 245 250 255 

m 

gcg ttc gaa age gcg gge ate acc eeg gac gcg gcg egg gge age gae 815 

Ala Phe Glu Ser Ala Gly lie Thr Pro Asp Ala Ala Arg Gly Ser Asp 
260 265 270 

acc gge gtg ttc ate gge gcg ttc tee tac ggg tac gge acg ggt gcg 863 
Thr Gly Val Phe lie Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala 
275 280 285 

gat ace aac gge ttc gge gcg aca ggg teg cag ace age gtg etc tec 911 
Asp Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser 
290 295 300 

gge cge etc teg tac ttc tac ggt ctg gag gge ect teg gtc acg gtc 959 
Gly Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly Pro Ser Val Thr Val 
305 310 315 

gac acc gee tgc teg teg tea ctg gtc gcc ctg cac cag gca ggg cag 1007 
Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin 
320 325 330 335 

tec ctg cge teg gge gaa tgc teg etc gcc ctg gtc gge ggt gtc acg 1055 
Ser Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr 
340 345 350 
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gtg atg gcg teg ccc ggc gga ttc gtc gag ttc tec egg eag egc ggg 1103 
Val Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly 
355 360 365 

etc gcg ccg gac ggg egg gcg aag gcg ttc ggc gcg ggc gcg gac ggt 1151 
Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly 
370 375 380 

acg age ttc gee gag ggc gee ggt gee ctg gtg gtc gag egg etc tec 1199 
Thr Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser 
385 390 395 

gac gcg gag egc cac ggc eac acc gtc etc gee etc gta egc ggc tec 1247 
Asp Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser 
400 405 410 415 

gcg get aac tec gac ggc gcg teg aac ggt ctg teg gcg ccg aac ggc 1295 
Ala Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly 
420 425 430 

o 

9^9 etc 1343 

Pro Ser Gin Glu Arg Val lie His Gin Ala Leu Ala Asn Ala Lys Leu 
435 440 445 

ace ccc gee gat gtc gac gcg gtc gag gcg cac ggc acc ggc ace egc 13 91 
W Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg 
H 450 455 460 

m ■ 

5_ etc ggc gac ccc ate gag gcg eag gcg ctg etc gcg acg tac gga eag 1.439 

p Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin 

^ 465 470 475 

l^i gac egg gcg acg ccc ctg ctg etc ggc teg ctg aag teg aac ate ggg 1487 

J Asp Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn He Gly 

1*^ 480 485 490 495 

cac gee eag gee gcg tea ggg gtc gee ggg ate ate aag atg gtg cag 1535 
His Ala Gin Ala Ala Ser Gly Val Ala Gly He lie Lys Met Val Gin 
500 505 510 

gec^ ate egg cac ggg gaa ctg ccg ccg aca ctg cac gcg gac gag ccg 1583 
Ala He Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro 
515 520 525 

teg ccg cac. gtc gac tgg acg gee ggt gee gtc gag etc ctg acg teg 1631 
Ser Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser 
530 535 540 

gee egg ccg tgg ccg ggg acc ggt egc ccg egc egc get gee gtc teg 1679 
Ala Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Ala Val Ser 
545 550 555 

teg ttc ggc gtg age ggc acg aac gee cac ate ate ett gag gca gga 1727 
Ser Phe Gly Val Ser Gly Thr Asn Ala His He He Leu Glu Ala Gly 
560 565 570 575 
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ccg gtc aaa acg gga ccg gtc gag gca gga gcg ate gag gca gga ccg 1775 
Pro Val Lys Thr Gly Pro Val Glu Ala Gly Ala lie Glu Ala Gly Pro 
580 585 590 

gtc gaa gta gga ccg gtc gag get gga ccg etc ccc gcg gcg ccg ccg 1823 
Val Glu Val Gly Pro Val Glu Ala Gly Pro Leu Pro Ala Ala Pro Pro 
595 . 600 605 

tea gca ccg gge gaa gae ett ccg ctg etc gtg teg gcg cgt tee ccg 18 71 
Ser Ala Pro Gly Glu Asp- Leu Pro Leu Leu Val Ser Ala Arg Ser Pro 
610 615 620 

gag gca etc gae gag eag ate ggg egc ctg cge gee tat etc gae acc 1919 
Glu Ala Leu Asp Glu Gin lie Gly Arg Leu Arg Ala Tyr Leu Asp Thr 
625 630 635 

gge ccg gge gtc gae egg gcg gee gtg gcg cag aca ctg gee egg cgt 1967 
Gly Pro Gly Val Asp Arg Ala Ala Val Ala Gin Thr Leu Ala Arg Arg 
640 645 650 655 

^ acg cac tte acc cac egg gee gta ctg etc ggg gae ace gtc ate gge 2015 
Thr His Phe Thr His Arg Ala Val Leu Leu Gly Asp Thr Val lie Gly 
660 665 670 



m 



get ccc ccc gcg gae cag gee gae gaa etc gtc ttc gtc tac tec ggt 2063 
Ala Pro Pro Ala Asp Gin Ala Asp Glu Leu Val Phe Val Tyr Ser Gly 
675 680 685 



w 

Si ■■ - . ■ 

s cag gge ace cag eat ccc gcg atg gge gag cag eta gee gee gcg tte 2111 
p Gin Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala Phe 
fti 690 695 700 



eee gtc tte gcg egg ate eat cag cag gtg tgg gae ctg etc gat gtg 2159 
Pro Val Phe Ala Arg lie His Gin Gin Val Trp Asp Leu Leu Asp Val 
705 710 715 

ccc gat ctg gag gtg aac gag acc ggt tac gee cag ccg gee ctg ttc 2207 
Pro Asp Leu Glu Val Asn Glu Thr Gly Tyr Ala Gin Pro Ala Leu Phe 
720 725 730 735 

gca atg eag gtg get ctg tte ggg ctg ctg gaa teg tgg ggt gta ega 2255 
Ala Met Gin Val Ala Leu Phe Gly Leu Leu Glu Ser Trp Gly Val Arg 
740 745 750 

ccg gae gcg gtg ate gge cat teg gtg ggt gag ctt gcg get gcg tat 2303 
Pro Asp Ala Val lie Gly His Ser Val Gly Glu Leu Ala Ala Ala Tyr 
755 760 765 

gtg tec ggg gtg tgg teg ttg gag gat gee tgc act ttg gtg teg gcg 2351 
Val Ser Gly Val Trp Ser Leu Glu Asp Ala Cys Thr Leu Val Ser Ala 
770 775 780 

egg get cgt ctg atg eag get ctg ccc gcg ggt ggg gtg atg gtc get 23 99 
Arg Ala Arg Leu Met Glri Ala Leu Pro Ala Gly Gly Val Met Val Ala 
785 790 795 
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€1 



gtc ccg gtc teg gag gat gag gcc egg gcc gtg ctg ggt gag ggt gtg 244 7 
Val Pro Val Ser Glu Asp Glu Ala Arg Ala Val Leu Gly Glu Gly Val 
800 805 810 815 

gag ate gcc geg gtc aac gge ceg teg teg gtg gtt etc tec ggt gat .2495 
Glu He Ala Ala Val Asn Gly Pro Ser Ser Val Val Leu Ser Gly Asp 
820 825 830 

gag gee gee gtg ctg cag gcc geg gag ggg ctg ggg aag tgg acg egg 2543 
Glu Ala Ala Val Leu Gin Ala Ala Glu Gly Leu Gly Lys Trp Thr Arg 
835 840 845 

ctg geg acc age cae geg tte cat tec gee cgt atg gaa cec atg ctg 2591 
Leu Ala Thr Ser His Ala Phe His Ser Ala Arg Met Glu Pro Met Leu 
850 855 860 

gag gag tte egg geg gtc gcc gaa gge ctg acc tac egg acg ceg cag 2 63 9 
Glu Glu Phe Arg Ala Val Ala Glu Gly Leu Thr Tyr Arg Thr Pro Gin 
865 870 875 

gtc tec atg gee gtt ggt gat cag gtg acc acc get gag tac tgg gtg 2687 
Val Ser Met Ala Val Gly Asp Gin Val Thr Thr Ala Glu Tyr Trp Val 
880 885 890 895 



p egg cag gtc egg gae acg gtc egg tte gge gag cag gtg gee teg tac 2735 
UJ Arg Gin Val Arg Asp Thr Val Arg Phe Gly Glu Gin Val Ala Ser Tyr 
H 900 905 910 

01 

2 gag gae gee gtg tte gtc gag ctg ggt gcc gae egg. tea ctg gcc cge 2783 

Glu Asp Ala Val Phe Val Glu Leu Gly Ala Asp Arg Ser Leu Ala Arg 

g 915 920 925 

ctg gtc gae ggt gtc geg atg ctg cae gge gae cae gaa ate cag gcc 2 831 
^ Leu Val Asp Gly Val Ala Met Leu His Gly Asp His Glu lie Gin Ala 
930 935 940 

geg ate gge gcc ctg gee cae ctg tat gtc aac gge gtc acg gtc gae 2879 
Ala He Gly Ala Leu Ala His Leu Tyr Val Asn Gly Val Thr Val Asp 
945 950 955 

tgg cec geg etc ctg gge gat get ccg gca aca egg gtg ctg gae ett 2927 
Trp Pro Ala Leu Leu Gly Asp Ala Pro Ala Thr Arg Val Leu Asp Leu 
960 965 970 975 

ccg aca tac gcc tte cag cae cag egc tac tgg etc gag teg get cec 2 975 
Pro Thr Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu Ser Ala Pro 
980 985 990 

ceg gcc acg gcc gae teg gge cae cec gtc etc gge acc gga gtc gcc 3023 
Pro Ala Thr Ala Asp Ser Gly His Pro Val Leu Gly Thr Gly Val Ala 
995 1000 1005 

gtc gcc ggg teg ccg gge egg gtg tte acg ggt cec gtg cec gcc ggt 3071 
Val Ala Gly Ser Pro Gly Arg Val Phe Thr Gly Pro Val Pro Ala Gly 
1010 1015 1020 
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gcg gac cgc gcg gtg ttc ate gcc gaa ctg gcg etc gcc gcc gcc gac 3119 
Ala Asp Arg Ala Val Phe lie Ala Glu Leu Ala Leu Ala Ala Ala Asp 
1025 1030 1035 

gcc acc gac tgc gcc acg gtc gaa cag etc gac gtc acc tec gtg cec 3167 
Ala Thr Asp Cys Ala Thr Val Glu Gin Leu Asp Val Thr Ser Val Pro 
1040 1045 1050 1055 

ggc gga tec gee cgc ggc agg gee acc gcg cag acc tgg gtc gat gaa 3215 
Gly Gly Ser Ala Arg Gly Arg Ala Thr Ala Gin Thr Trp Val Asp Glu 
1060 1065 1070 

cec gcc gee gac ggg egg cgc cgc ttc acc gtc cac acc cgc gtc ggc 3 2 63 
Pro Ala Ala Asp Gly Arg Arg Arg Phe Thr Val His Thr Arg Val Gly 
1075 1080 1085 

gac gcc ceg tgg acg ctg cac gcc gag ggg gtt etc cgc cec ggc cgc 3311 
Asp Ala Pro Trp Thr Leu His Ala Glu Gly Val Leu Arg Pro Gly Arg 
1090 1095 lido 

gtg cec cag cec gaa gcc gtc gac ace gcc tgg cec ceg ccg ggc gcg 3 359 
Val Pro Gin Pro Glu Ala Val Asp Thr Ala Trp Pro Pro Pro Gly Ala 
1105 1110 1115 



Q gtg cec gcg gac ggg ctg cec ggg gcg tgg cga cgc gcg gac cag gtc 3407 

UJ Val Pro Ala Asp Gly Leu Pro Gly Ala Trp Arg Arg Ala Asp Gin Val 
M. 1120 1125 1130 1135 

yi- . ■ - 

g ttc gtc gaa gcc gaa gtc gac age ect gac ggc ttc gtg gea cac cec 3455 

Phe Val Glu Ala Glu Val Asp Ser Pro Asp Gly Phe Val Ala . His Pro 
g 1140 1145 1150 

m 

[-^ gac ctg etc gac gcg gtc ttc tec gcg gtc ggc gac ggg age cgc cag 3503 

^1 Asp Leu Leu Asp Ala Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin 
O 1155 1160 1165 

ccg acc gga tgg cgc gac etc gcg gtg cac gcg teg gac gcc acc gtg 3551 

Pro Thr Gly Trp Arg Asp Leu Ala Val His Ala Ser Asp Ala Thr Val 
1170 1175 1180 

ctg cgc gcc tgc etc acc cgc cgc gac agt ggt gtc gtg gag etc gcc 3599 

Leu Arg Ala Cys Leu Thr Arg Arg Asp Ser Gly Val Val Glu Leu Ala 
1185 1190 1195 

gee ttc gac ggt gee gga atg ccg gtg etc ace gcg gag teg gtg acg 3647 

Ala Phe Asp Gly Ala Gly Met Pro Val Leu Thr Ala Glu Ser Val Thr 
1200 1205 1210 1215 

ctg ggc gag gtc gcg teg gea ggc gga tec gac gag teg gac ggt ctg 3695 

Leu Gly Glu Val Ala Ser Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu 
1220 1225 1230 

ett egg ctt gag tgg ttg ccg gtg gcg gag gee cac tac gac ggt gcc 3743 

Leu Arg Leu Glu Trp Leu Pro Val Ala Glu Ala His Tyr Asp Gly Ala 
1235 1240 1245 
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gac gag ctg ccc gag ggc tac acc etc ate acc gcc aca cac ccc gac 3791 
Asp Glu Leu Pro Glu Gly Tyr Thr Leu lie Thr Ala Thr His Pro Asp 
1250 1255 1260 

gac ccc gac gac ccc acc aac ccc cac aac aca ccc aca cgc acc cac 383 9 
Asp Pro Asp Asp Pro Thr. Asn Pro His Asn Thr Pro Thr Arg Thr His 
1265 1270 1275 

aca caa acc aca cgc gtc etc acc gcc etc caa cac cac etc ate acc 3 887 
Thr Gin Thr Thr Arg Val Leu Thr Ala Leu Gin His His Leu lie Thr 
1280 1285 1290 1295 

acc aac cac ace etc ate gtc cac acc ace acc gac ccc cca ggc gee 3935 
Thr Asn His Thr Leu lie Val His Thr Thr Thr Asp Pro Pro Gly Ala 
1300 1305 1310 

gcc gtc ace ggc etc ace cgc ace gca caa aac gaa cac ccc ggc cgc 3 983 
Ala Val Thr Gly Leu Thr Arg Thr Ala Gin Asn Glu His Pro Gly Arg 
1315 1320 1325 

lie His Leu lie Glu Thr His His Pro His Thr Pro Leu Pro Leu Thr 
''^ 1330 1335 1340 

C3 caa etc acc acc etc cac caa ccc cac eta cgc etc acc aac aac acc 4079 
yj Gin Leu Thr Thr Leu His Gin Pro His Leu Arg Leu Thr Asn Asn Thr 
1345 1350 1355 

etc cac ace ccc cac etc acc ccc ate acc acc cac cac aac acc ace 4127 
Leu His Thr Pro His Leu Thr Pro lie Thr Thr His His Asn Thr Thr 
1360 1365 1370 1375 



Ij^ aca ace ace ccc aac ace cca ccc etc aac ccc aac cac gee ate etc 4175 

''^ Thr Thr Thr Pro Asn Thr Pro Pro Leu Asn Pro Asn His Ala lie Leu 

P 1380 1385 1390 

ate ace ggc ggc tec ggc ace etc gcc ggc ate etc gcc cgc cac etc 4223 

lie Thr Gly Gly Ser Gly Thr Leu Ala Gly lie Leu Ala Arg His Leu 

1395 1400 1405 

aac cac ccc cac ace tac etc etc tec cgc aca cca cca ccc ccc ace 4271 

Asn His Pro His Thr Tyr Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr 

1410 1415 1420 

aca ccc ggc ace cac ate ccc tge gac etc ace gac ccc ace caa ate 4319 

Thr Pro Gly Thr His lie Pro Cys Asp Leu Thr Asp Pro Thr Gin lie 
1425 1430 1435 

ace caa gcc etc acc cac ata cca caa ccc etc ace ggc ate tte cac 4367 

Thr Gin Ala Leu Thr His lie Pro Gin Pro Leu Thr Gly lie Phe His 

1440 1445 1450 1455 

acc gee gee acc etc gac gac gcc ace etc acc aac etc acc ccc caa 4415 

Thr Ala Ala Thr Leu Asp Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin 

1460 1465 1470 
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cac etc acc acc acc etc caa cec aaa gcc gac gcc gee tgg cac etc 4463 
His Leu Thr Thr Thr Leu Gin Pro Lys Ala Asp Ala Ala Trp His Leu 
1475 1480 1485 

eac cac cac acc caa aac caa ccc etc acc cac ttc gtc etc tac tec 4511 
His His His Thr Gin Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser 
1490 1495 1500 

age gcc gee gee ace etc ggc age ccc ggc caa gcc aac tac gcc gee 4559 
Ser Ala Ala Ala Thr Leu Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala 
1505 1510 1515 

gee aac gcc ttc etc gac gee etc gee ace cac cgc cac ace caa gga 4607 
Ala Asn Ala Phe Leu Asp Ala Leu Ala Thr His Arg His Thr Gin Gly 
1520 1525 1530 1535 



caa ccc gcc ace acc ate gcc tgg ggc atg tgg cac ace ace acc aca 4655 
Gin Pro Ala Thr Thr lie Ala Trp Gly Met Trp His Thr Thr Thr Thr 
1540 1545 1550 

etc ace age caa etc ace gac age gac cgc gac cgc ate cgc cgc ggc 4703 
Leu Thr Ser Gin Leu Thr Asp Ser Asp Arg Asp Arg lie Arg Arg Gly 
1555 1560 1565 

ggc ttc etg ecg ate teg gac gac gag ggc atg e 4737 
Gly Phe Leu Pro lie Ser Asp Asp Glu Gly Met 
1570 1575 



<210> 31 
<211> 1578 
<212> PRT 

<213> Artificial Sequence 
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Q <220> 

Lsi, <223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 31 

Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val Val 
15 10 15 

Ala Ala Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly Leu 
20 25 30 

Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu Ala 
35 40 " 45 

Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro Ser 
50 55 60 

Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly Ala 
65 70 75 80 



Glu Asp lie 



Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly lie Asp Ser 
85 90 95 
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Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly Val 
100 105 110 

Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro Arg Ala Leu 
115 120 125 

Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val Ala 
130 135 140 

Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala lie 
145 150 ' 155 160 

Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin Glu 
165 170 175 

Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe Pro 
180 185 190 

Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro Asp 
195 200 205 

^ Ala lie Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly Ala 
^ 210 215 220 

€J 

=P Thr Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro Arg Glu Ala Leu 
Q 225 230 235 240 

w 

Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu Ala 

gi 245 . 250 255 

Phe Glu Ser Ala Gly lie Thr Pro Asp Ala Ala Arg Gly Ser Asp Thr 
S 260 265 270 



Gly Val Phe lie Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala Asp 
275 280 285 

Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser Gly 
290 295 300 

Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly Pro Ser Val Thr Val Asp 
305 310 315 320 

Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin Ser 
325 330 335 

Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr Val 
340 345 350 

Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly Leu 
355 360 365 

Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly Thr 
370 375 380 



Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser Asp 
385 390 395 400 
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Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser Ala 
405 410 415 

Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro 
420 425 430 

Ser Gin Glu Arg Val lie His Gin Ala Leu Ala Asn Ala Lys Leu Thr 
435 440 445 

Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 
450 455 460 

Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Asp 
465 470 475 480 

Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His 
485 490 495 

Ala Gin Ala Ala Ser Gly Val Ala Gly lie lie Lys Met Val Gin Ala 
500 505 510 

"•J- He Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro Ser 
^ 515 520 525 

O Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser Ala 

Ul 530 535 ' 540 

gi Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Ala Val Ser Ser 

g 545 550 555 560 

S Phe Gly Val Ser Gly Thr Asn Ala His lie He Leu Glu Ala Gly Pro 
Si 565 570 575 



Val Lys Thr Gly Pro Val Glu Ala Gly Ala He Glu Ala Gly Pro Val 



fc3 580 585 590 

Glu Val Gly Pro Val Glu Ala Gly Pro Leu Pro Ala Ala Pro Pro Ser 
595 600 605 

Ala Pro Gly Glu Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu 
610 615 620 

Ala Leu Asp Glu Gin He Gly Arg Leu Arg Ala Tyr Leu Asp Thr Gly 
625 630 635 640 

Pro Gly Val Asp Arg Ala Ala Val Ala Gin Thr Leu Ala Arg Arg Thr 
645 650 655 

His Phe Thr His Arg Ala Val Leu Leu Gly Asp Thr Val He Gly Ala 
660 665 670 

Pro Pro Ala Asp Gin Ala Asp Glu Leu Val Phe Val Tyr Ser Gly Gin 
675 680 685 



Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala Phe Pro 
690 695 700 
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Val Phe Ala Arg lie His Gin Gin Val Trp Asp Leu Leu Asp Val Pro 
705 710 715 720 

Asp Leu Glu Val Asn Glu Thr Gly Tyr Ala Gin Pro Ala Leu Phe Ala 
725 730 735 

Met Gin Val Ala Leu Phe Gly Leu Leu Glu Ser Trp Gly Val Arg Pro 
740 745 ' 750 

Asp Ala Val He Gly His Ser Val Gly Glu Leu Ala Ala Ala Tyr Val 
755 760 765 

Ser Gly Val Trp Ser Leu Glu Asp Ala Cys Thr Leu Val Ser Ala Arg 
770 775 780 

Ala Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Val Met Val Ala Val 
785 790 795 800 

Pro Val Ser Glu Asp Glu Ala Arg Ala Val Leu Gly Glu Gly Val Glu 
805 810 815 

He Ala Ala Val Asn Gly Pro Ser Ser Val Val Leu Ser Gly Asp Glu 
820 825 . 830 

Ala Ala Val Leu Gin Ala Ala Glu Gly Leu Gly Lys Trp Thr Arg Leu 
835 840 845 

Ala Thr Ser His Ala Phe His Ser Ala Arg Met Glu Pro Met Leu Glu 
850 855 860 

Glu Phe Arg Ala Val Ala Glu Gly Leu Thr Tyr Arg Thr Pro Gin Val 
865 870 875 880 

Ser Met Ala Val Gly Asp Gin Val Thr Thr Ala Glu Tyr Trp Val Arg 
885 890 895 

Gin Val Arg Asp Thr Val Arg Phe Gly Glu Gin Val Ala Ser Tyr Glu 
900 905 910 

Asp Ala Val Phe Val Glu Leu Gly Ala Asp Arg Ser Leu Ala Arg Leu 
915 920 925 

Val Asp Gly Val Ala Met Leu His Gly Asp His Glu He Gin Ala Ala 
930 935 940 

He Gly Ala Leu Ala His Leu Tyr Val Asn Gly Val Thr Val Asp Trp 
945 950 955 960 

Pro Ala Leu Leu Gly Asp Ala Pro Ala Thr Arg Val Leu Asp Leu Pro 
965 970 975 

Thr Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu Ser Ala Pro Pro 
980 985 990 

Ala Thr Ala Asp Ser Gly His Pro Val Leu Gly Thr Gly Val Ala Val 
995 1000 1005 
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Ala Gly Ser Pro Gly Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala 
1010 1015 1020 

Asp Arg Ala Val Phe lie Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala 
025 1030 1035 104 

Thr Asp Cys Ala Thr Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly 
1045 1050 1055 

Gly Ser Ala Arg Gly Arg Ala Thr Ala Gin Thr Trp Val Asp Glu Pro 
1060 1065 1070 

Ala Ala Asp Gly Arg Arg Arg Phe Thr Val His Thr Arg Val Gly Asp 
1075 1080 1085 

Ala Pro Trp Thr Leu His Ala Glu Gly Val Leu Arg Pro Gly Arg Val 
1090 1095 1100 

Pro Gin Pro Glu Ala Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val 
Q 105 1110 1115 112 

Pro Ala Asp Gly Leu Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe 
^ 1125 1130 1135 

Q Val Glu Ala Glu Val Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp 
W 1140 1145 1150 

g*! Leu Leu Asp Ala Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro 
5 1155 1160 . 1165 

□ 

^ Thr Gly Trp Arg Asp Leu Ala Val His Ala Ser Asp Ala Thr Val Leu 

^ 1170 1175 1180 

^ Arg Ala Cys Leu Thr Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala 

O 185 1190 1195 120 

L=£t 

Phe Asp Gly Ala Gly Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu 
1205 1210 1215 

Gly Glu Val Ala Ser Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu 
1220 1225 1230 

Arg Leu Glu Trp Leu Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp 
1235 1240 1245 

Glu Leu Pro Glu Gly Tyr Thr Leu lie Thr Ala Thr His Pro Asp Asp 
1250 1255 1260 

Pro Asp Asp Pro Thr Asn Pro His Asn Thr Pro Thr Arg Thr His Thr 
265 1270 1275 128 

Gin Thr Thr Arg Val Leu Thr Ala Leu Gin His His Leu lie Thr Thr 
1285 1290 1295 



Asn His Thr Leu lie Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala 
1300 1305 1310 
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Val Thr Gly Leu Thr Arg Thr Ala Gin Asn Glu His Pro Gly Arg lie 
1315 1320 1325 

His Leu lie Glu Thr His His Pro His Thr Pro Leu Pro Leu Thr Gin 
1330 1335 1340 

Leu Thr Thr Leu His Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu 
345 1350 1355 136 

His Thr Pro His Leu Thr Pro lie Thr Thr His His Asn Thr Thr Thr 
1365 1370 1375 

Thr Thr Pro Asn Thr Pro Pro Leu Asn Pro Asn His Ala lie Leu lie 
1380 1385 1390 

Thr Gly Gly Ser Gly Thr Leu Ala Gly lie Leu Ala Arg His Leu Asn 
1395 1400 1405 

His Pro His Thr Tyr Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr 
1410 1415 1420 

Pro Gly Thr His lie Pro Cys Asp Leu Thr Asp Pro Thr Gin lie Thr 
425 1430 1435 144 

Gin Ala Leu Thr His lie Pro Gin Pro Leu Thr Gly lie Phe His Thr 
1445 1450 1455 

Ala Ala Thr Leu Asp Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His 
1460 1465 1470 

Leu Thr Thr Thr Leu Gin Pro Lys Ala Asp Ala Ala Trp His Leu His 
1475 1480 1485 

His His Thr Gin Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser 
1490 1495 1500 

Ala Ala Ala Thr Leu Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala 
505 1510 . 1515 152 

Asn Ala Phe Leu Asp Ala Leu Ala Thr His Arg His Thr Gin Gly Gin 
1525 1530 1535 

Pro Ala Thr Thr lie Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu 
1540 1545 1550 

Thr Ser Gin Leu Thr Asp Ser Asp Arg Asp Arg lie Arg Arg Gly Gly 
1555 1560 1565 

Phe Leu Pro lie Ser Asp Asp Glu Gly Met 
1570 1575 



<210> 32 
<211> 4818 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 

<221> CDS 

<222> (3) . . (4817) 

<400> 32 

gc atg egg ctg tac gag gcg gca egg cgc acc gga agt ccc gtg gtg 4 7 
Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val 







1 








5 










10 










15 






gtg 

Val 


gcg 
Ala 


gee 
Ala 


gcg 
Ala 


etc 
Leu 
20 


gae 
Asp 


gae 
Asp 


gcg 
Ala 


eeg 
Pro 


gae 
Asp 
2 5 


gtg 
Val 


eeg 
Pro 


ctg 
Leu 


ctg 
Leu 


cgc 
Arg 
30 


ggg 

Gly 


95 




ctg 
Leu 


egg 
Arg 


cgt 
Arg 


aeg 
Thr 
35 


acc 
Thr 


gtc 
Val 


egg 
Arg 


cgt 
Arg 


gcc 
Ala 
40 


gcc 
Ala 


gtc 
Val 


egg 
Arg 


gaa 
Glu 


cgc 
Arg 
45 


tet 
Ser 


etc 
Leu 


143 


j — i 


gcc 
Ala 


gae 
Asp 


cgc 
Arg 
50 


teg 
Ser 


eeg 
Pro 


tgc 
Cys 


tgc 
Cys 


eeg 
Pro 
55 


aeg 
Thr 


aeg 
Thr 


age 
Ser 


gcg 
Ala 


ccg 
Pro 
60 


aeg 
Thr 


ect 
Pro 


ccc 
Pro 


191 


Ml 


teg 
Ser 


cgt 
Arg 
65 


teg 
Ser 


tec 
Ser 


tgg 
Trp 


aac 
Asn 


age 
Ser 
70 


ace 
Thr 


gcc 
Ala 


acc 
Thr 


gtg 
Val 


etc 
Leu 
75 


gge 

Gly 


cac 
His 


ctg 
Leu 


gge 
Gly 


239 


ffl 
a 


gcc 
Ala 
8 0 


gaa 
Glu 


gae 
Asp 


ate 
He 


eeg 
Pro 


gcg 
Ala 
85 


aeg 
Thr 


aeg 
Thr 


aeg 
Thr 


ttc 
Phe 


aag 
Lys 
90 


gaa 
Glu 


etc 
Leu 


gge 

Gly 


ate 
He 


gae 
Asp 
95 


287 


ru 

''-■4 

o 


teg 
Ser 


etc 
Leu 


ace 
Thr 


gcg 

Ala 


gtc 
Val 
100 


cag 
Gin 


ctg 
Leu 


cgc ■ 
Arg 


aac 
Asn 


gcg 
Ala 
105 


ctg 
Leu 


acc 
Thr 


aeg 
Thr 


gcg 

Ala 


ace 
Thr 
110 


gge 

Gly 


335 




gta 
Val 


cgc 
Arg 


etc 
Leu 


aac 
Asn 
115 


gee 
Ala 


aca 
Thr 


gcg 
Ala 


gtc 
Val 


ttc 
Phe 
120 


gae 
Asp 


ttt 
Phe 


ccg 
Pro 


aeg 
Thr 


ccg 
Pro 
125 


cgc 
Arg 


gcg 
Ala 


383 




etc 
Leu 


gcc 
Ala 


gcg 

Ala 
130 


aga 
Arg 


etc 
Leu 


gge 
Gly 


gae 
Asp 


gag 
Glu 
135 


ctg 
Leu 


gcc 
Ala 


ggt 
Gly 


acc 
Thr 


cgc 
Arg 
140 


gcg 
Ala 


ccc 
Pro 


gtc 
Val 


431 




gcg 

Ala 


gcc 
Ala 
145 


egg 
Arg 


ace 
Thr 


gcg 
Ala 


gee 
Ala 


acc 
Thr 
150 


gcg 
Ala 


gcc 
Ala 


gcg 

Ala 


cac 
His 


gae 
Asp 
155 


gaa 
Glu 


eeg 
Pro 


ctg 
Leu 


gcg 
Ala 


479 




ate 
He 
160 


gtg 
Val 


gge 
Gly 


atg 
Met 


gee 
Ala 


tgc 
Cys 
165 


cgt 
Arg 


ctg 
Leu 


eeg 
Pro 


gge 
Gly 


ggg 

Gly 
170 


gtc 
Val 


gcg 
Ala 


teg 
Ser 


eca 
Pro 


cag 
Gin 
175 


527 




gag 
Glu 


ctg 
Leu 


tgg 
Trp 


cgt 
Arg 


etc 
Leu 


gtc 
Val 


gcg 
Ala 


tec 
Ser 


gge 

Gly 


acc 
Thr 


gae 
Asp 


gcc 
Ala 


ate 
He 


aeg 
Thr 


gag 
Glu 


ttc 
Phe 


575 



180 185 190 
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ccc gcg gac cgc ggc tgg gac gtg gac gcg etc tac gac ccg gac ccc 
Pro Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro 
195 200 205 

gac gcg ate ggc aag acc ttc gtc egg cac ggc ggc ttc etc gac ggt 
Asp Ala lie Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly 
210 215 220 

gcg acc ggc ttc gac gcg gcg ttc ttc ggg ate age ccg cgc gag gcc 
Ala Thr Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro Arg Glu Ala 
225 230 235 

ctg gcc atg gac ccg cag caa egg gtg etc ctg gag acg tec tgg gag 
Leu Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu 
240 245 250 255 

gcg ttc gaa age gcg ggc ate ace ccg gac gcg gcg egg ggc age gac 
Ala Phe Glu Ser Ala Gly lie Thr Pro Asp Ala Ala Arg Gly Ser Asp 
260 265 270 

acc ggc gtg ttc ate ggc gcg ttc tec tac ggg tac ggc acg ggt gcg 
Thr Gly Val Phe lie Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala 
275 280 285 

gat acc aac ggc ttc ggc gcg aca ggg teg cag acc age gtg etc tec 
Asp Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser 
290 295 300 

ggc cgc etc teg tac ttc tac ggt ctg gag ggc cct teg gtc acg gtc 
Gly Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly Pro Ser Val Thr Val 
305 310 315 

gac acc gee tgc teg teg tea ctg gtc gee ctg cac cag gea ggg cag 
Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin 
320 325 330 335 

tec ctg cgc teg ggc gaa tgc teg etc gee ctg gtc ggc ggt gtc acg 
Ser Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr 
340 345 350 

gtg atg gcg teg ccc ggc gga ttc gtc gag ttc tec egg cag cgc ggg 
Val Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly 
355 360 365 

etc gcg ccg gac ggg egg gcg aag gcg ttc ggc gcg ggc gcg gac ggt 
Leu Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly 
370 375 380 

acg age ttc gee gag ggc gcc ggt gee ctg gtg gtc gag egg etc tec 
Thr Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser 
385 390 395 



gac gcg gag cgc cac ggc cac ace gtc etc gee etc gta cgc ggc tee 
Asp Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser 
400 405 410 415 
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gcg get aac tec gac ggc gcg teg aac ggt ctg teg gcg ceg aae gge 
Ala Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly 
420 425 430 

cce tee eag gaa egc gte ate eac cag gee ete geg aae geg aaa etc 
Pro Ser Gin Glu Arg Val lie His Gin Ala Leu Ala Asn Ala Lys Leu 
435 440 445 

aec eee gee gat gtc gac geg gte gag geg cac ggc acc gge ace cge 
Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg 
450 455 460 

etc ggc gac ccc ate gag gcg cag geg ctg ete gcg acg tac gga cag 
Leu Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin 
465 470 475 

gac egg geg acg ccc ctg ctg etc gge teg ctg aag teg aae ate ggg 
Asp Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly 
480 485 490 495 

eac gee cag gee gcg tea ggg gtc gee ggg ate ate aag atg gtg cag 
His Ala Gin Ala Ala Ser Gly Val Ala Gly lie lie Lys Met Val Gin 
500 505 510 



W gee ate egg eac ggg gaa ctg ccg ceg aca ctg cac gcg gac gag ceg 

hi Ala lie Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro 

M, 515 520 525 

m ■ ■ ■ 

teg ceg cac gtc gac tgg acg gee ggt gee gtc. gag etc ctg acg teg 

f« Ser Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser 

^ 530 535 540 

=~ gee egg ccg tgg ccg ggg acc ggt cgc ccg cge egc get gee gtc teg 

Ala Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Ala Val Ser 

545 550 555 



teg ttc ggc gtg age ggc acg aac gee cac ate ate ctt gag gca gga 
Ser Phe Gly Val Ser Gly Thr Asn Ala His lie lie Leu Glu Ala Gly 
560 565 570 575 

ccg gtc aaa acg gga ccg gtc gag gca gga geg ate gag gca gga ccg 
Pro Val Lys Thr Gly Pro Val Glu Ala Gly Ala lie Glu Ala Gly Pro 
580 585 590 

gte gaa gta gga ccg gte gag get gga ceg etc cce geg gcg ccg ceg 
Val Glu Val Gly Pro Val Glu Ala Gly Pro Leu Pro Ala Ala Pro Pro 
595 600 605 



tea gca ccg ggc gaa gac ctt ceg ctg etc gtg teg geg cgt tec ceg 
Ser Ala Pro Gly Glu Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro 
610 615 620 

gag gca etc gac gag eag ate ggg cge ctg cge gee tat etc gac ace 
Glu Ala Leu Asp Glu Gin lie Gly Arg Leu Arg Ala Tyr Leu Asp Thr 
625 630 635 



1295 



1343 



1391 



1439 



1487 



1535 



1583 



1631 



1679 



1727 



1775 



1823 



1871 



1919 
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ggc ccg ggc gtc gac egg gcg gcc gtg gcg cag aca ctg gcc egg cgt 
Gly Pro Gly Val Asp Arg Ala Ala Val Ala Gin Thr Leu Ala Arg Arg 
640 645 650 655 

aeg cac tte aee eae egg gcc gta etg etc ggg gac ace gtc ate ggc 
Thr His Phe Thr His Arg Ala Val Leu Leu Gly Asp Thr Val lie Gly 
660 665 670 

get ecc ecc gcg gac cag gee gac gaa etc gtc ttc gtc tac tec ggt 
Ala Pro Pro Ala Asp Gin Ala Asp Glu Leu Val Phe Val Tyr Ser Gly 
675 680 685 

cag ggc ace cag eat ecc gcg atg ggc gag cag eta gcc gat teg teg 
Gin Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Asp Ser Ser 
690 695 700 

gtg gtg ttc gee gag egg atg , gee gag tgt gcg gcg gcg ttg ege gag 
Val Val Phe Ala Glu Arg Met Ala Glu Cys Ala Ala Ala Leu Arg Glu 
705 710 715 

tte gtg gac tgg gat ctg ttc aeg gtt etg gat gat ccg gcg gtg gtg 
Phe Val Asp Trp Asp Leu Phe Thr Val Leu Asp Asp Pro Ala Val Val 
720 725 730 735 

gac egg gtt gat gtg gtc cag ecc get tec tgg gcg atg atg gtt tec 
Asp Arg Val Asp Val Val Gin Pro Ala Ser Trp Ala Met Met Val Ser 
740 745 750 

etg gee gcg gtg tgg cag gcg gee ggt gtg egg ccg gat gcg gtg ate 
Leu Ala Ala Val Trp Gin Ala Ala Gly Val Arg Pro Asp Ala Val lie 
755 760 765 

ggc cat teg cag ggt gag ate gee gca get tgt gtg gcg ggt gcg gtg 
Gly His Ser Gin Gly Glu lie Ala Ala Ala Cys Val Ala Gly Ala Val 
770 775 780 

tea eta ege gat gee gee egg ate gtg aee ttg cgc age cag gcg ate 
Ser Leu Arg Asp Ala Ala Arg lie Val Thr Leu Arg Ser Gin Ala lie 
785 790 795 

gcc egg ggc ctg gcg ggc egg ggc gcg atg gca tec gtc gee ctg ecc 
Ala Arg Gly Leu Ala Gly Arg Gly Ala Met Ala Ser Val Ala Leu Pro 
800 805 810 815 

gcg cag gat gtc gag etg gtc gac ggg gee tgg ate gcc gcc cac aac 
Ala Gin Asp Val Glu Leu Val Asp Gly Ala Trp lie Ala Ala His Asn 
820 825 830 

ggg ecc gee tee aee gtg ate gcg ggc ace ccg gaa gcg gtc gac eat 
Gly Pro Ala Ser Thr Val lie Ala Gly Thr Pro Glu Ala Val Asp His 
i 835 840 845 



gtc etc acc get cat gag gca caa ggg gtg egg gtg egg egg ate aee 
Val Leu Thr Ala His Glu Ala Gin Gly Val Arg Val Arg Arg lie Thr 
850 855 860 
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gtc gac tat gcc teg cac acc ccg cac gtc gag ctg ate cge gac gaa 263 9 
Val Asp Tyr Ala Ser His Thr Pro His Val Glu Leu lie Arg Asp Glu 
865 870 875 

eta cte gac ate act age gac age age teg eag ace eeg etc gtg eeg 2687 
Leu Leu Asp lie Thr Ser Asp Ser Ser Ser Gin Thr Pro Leu Val Pro 
880 885 890 895 

tgg ctg teg ace gtg gac ggc ace tgg gtc gac age eeg ctg gac ggg 273 5 
Trp Leu Ser Thr Val Asp Gly Thr Trp Val Asp Ser Pro Leu Asp Gly 
900 905 910 

gag tac tgg tac egg aac ctg cgt gaa eeg gtc ggt ttc cac cec gcc 2783 
Glu Tyr Trp Tyr Arg Asn Leu Arg Glu Pro Val Gly Phe His Pro Ala 
915 920 925 

gtc age eag ttg eag gcc eag ggc gac ace gtg ttc gtc gag gtc age 2831 
Val Ser Gin Leu Gin Ala Gin Gly Asp Thr Val Phe Val Glu Val Ser 
930 935 940 

gcc age ccg gtg ttg ttg eag gcg atg gac gac gat gtc gtc aeg gtt 2879 
Ala Ser Pro Val Leu Leu Gin Ala Met Asp Asp Asp Val Val Thr Val 
945 950 955 

gee aeg ctg cgt cgt gac gac ggc gac gee acc egg atg etc acc gcc 2927 
Ala Thr Leu Arg Arg Asp Asp Gly Asp Ala Thr Arg Met Leu Thr Ala 
960 965 970 975 

ctg gca eag gcc tat gtc. cac ggc gtc ace gtc gac tgg cec gee ate 2975 
Leu Ala Gin Ala Tyr Val His Gly Val Thr Val Asp Trp Pro Ala lie 
980 985 990 



cte ggc acc acc aca acc egg gta ctg gac ctt ccg acc tac gcc ttc 3 023 
>-J Leu Gly Thr Thr Thr Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe 

Q 995 1000 1005 

caa cac eag egg tac tgg etc gag teg get cec ccg gcc aeg gee gac 3 071 
Gin His Gin Arg Tyr Trp Leu Glu Ser Ala Pro Pro Ala Thr Ala Asp 
1010 1015 1020 

teg ggc cac cec gtc etc ggc acc gga gtc gcc gtc gee ggg teg ccg 3119 
Ser Gly His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser Pro 
1025 1030 1035 

ggc egg gtg ttc aeg ggt cec gtg cec gcc ggt gcg gac cge gcg gtg 3167 
Gly Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala Val 
1040 1045 1050 1055 

ttc ate gcc gaa ctg gcg etc gee • gee gee gac gee acc gac tgc gee 3215 
Phe lie Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys Ala 
1060- 1065 1070 

aeg gtc gaa eag etc gac gtc ace tee gtg cec ggc gga tee gcc cge 3263 
Thr Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala Arg 
1075 1080 1085 



1^ 

m 



132 



ggc agg gcc acc gcg cag acc tgg gtc gat gaa ccc gcc gcc gac ggg 
Gly Arg Ala Thr Ala Gin Thr Trp Val Asp Glu Pro Ala Ala Asp Gly 
1090 1095 1100 

egg cgc cgc ttc acc gtc cac acc cgc gtc ggc gac gcc ccg tgg acg 
Arg Arg Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp Thr 
1105 1110 1115 

ctg cac gcc gag ggg gtt etc cgc ccc ggc cgc gtg ccc cag ccc gaa 
Leu His Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro Glu 
1120 1125 1130 1135 

gcc gtc gac acc gcc tgg ccc ccg ccg ggc gcg gtg ccc gcg gac ggg 
Ala Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly 
1140 1145 1150 

ctg ccc ggg gcg tgg cga cgc gcg gac cag gtc ttc gtc gaa gcc gaa 
Leu Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala Glu 
1155 1160 1165 

gtc gac age cet gac ggc ttc gtg gca cac ccc gac ctg etc gac gcg 
Val Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp Ala 
1170 1175 1180 

gtc ttc tec gcg gtc ggc gac ggg age cgc cag ccg acc gga tgg cgc 
Val Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp Arg 
1185 1190 1195 

gac etc gcg gtg cac gcg teg gac gcc acc gtg ctg cgc gee tgc etc 
Asp Leu Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu 
1200 1205 1210 1215 

ace cgc cgc gac agt ggt gtc gtg gag etc gcc gee ttc gac ggt gee 
Thr Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly Ala 
1220 1225 1230 

gga atg ccg gtg etc acc gcg gag teg gtg acg ctg ggc gag gtc gcg 
Gly Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val Ala 
1235 1240 1245 

teg gca ggc gga tec gac gag teg gac ggt ctg ctt egg ett gag tgg 
Ser Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu Trp 
1250 1255 1260 

ttg ccg gtg gcg gag gcc cac tac gac ggt gcc gac gag ctg ccc gag 
Leu Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro Glu 
1265 1270 1275 

ggc tac acc etc ate acc gcc aca cac ccc gac gac ccc gac gac ccc 
Gly Tyr Thr Leu lie Thr Ala Thr His Pro Asp Asp Pro Asp Asp Pro 
1280 1285 1290 1295 



acc aac ccc cac aac aca ccc aca cgc acc cac aca caa acc aca cge 
Thr Asn Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr Arg 
1300 1305 1310 
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gtc etc acc gcc etc eaa eae eac etc ate acc ace aac cac ace etc 3983 

Val Leu Thr Ala Leu Gin His His Leu lie Thr Thr Asn His Thr Leu 
1315 1320 1325 

ate gtc eae aee ace ace gac ccc eca ggc gcc gee gtc acc ggc etc 4 031 
lie Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly Leu 
1330 1335 1340 

acc cgc ace gea eaa aac gaa cac ccc ggc cgc ate eae etc ate gaa 4079 
Thr Arg Thr Ala Gin Asn Glu His Pro Gly Arg lie His Leu lie Glu 
1345 1350 1355 

acc cac cac ccc cac acc cea etc ccc etc acc eaa etc acc acc etc 4127 
Thr His His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr Leu 
1360 1365 1370 1375 

cac eaa ccc cac eta cgc etc ace aac aac acc etc eac acc ccc cac 4175 
His Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro His 
1380 1385 1390 

etc acc ccc ate acc ace eac eac aac aee aee aca aee acc ccc aac 4223 
Leu Thr Pro lie Thr Thr His His Asn Thr Thr Thr Thr Thr Pro Asn 
1395 1400 1405 



Q ace cea ccc etc aac ccc aac cac gee ate etc ate ace ggc ggc tec 4271 
yj Thr Pro Pro Leu Asn Pro Asn His Ala lie Leu lie Thr Gly Gly Ser 
1410 1415 1420 



ggc acc etc gcc ggc ate etc gcc cgc cac etc aac cac ccc eac acc 4319 



Q. Gly Thr Leu Ala Gly lie Leu Ala Arg His Leu Asn His Pro His Thr 
^ 1425 1430 • 1435 

ill 

l-^ tac etc etc tee cgc aca cea cea ccc ccc acc aca ccc ggc acc cac 4367 

Tyr Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr His 

Q 1440 1445 1450 1455 

ate ccc tgc gac etc acc gac ccc aee eaa ate aee eaa gcc etc aee 4415 

lie Pro Cys Asp Leu Thr Asp Pro Thr Gin lie Thr Gin Ala Leu Thr 

1460 1465 1470 

cac ata eca eaa ccc etc acc ggc ate ttc cac acc gee gcc acc etc 4463 

His lie Pro Gin Pro Leu Thr Gly lie Phe His Thr Ala Ala Thr Leu 
1475 1480 1485 

gac gac gcc acc etc ace aac etc ace ccc eaa eac etc acc ace ace 4511 

Asp Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr Thr 
1490 1495 1500 

etc eaa ccc aaa gee gac gee gee tgg eac etc cac cac cac acc eaa 4559 

Leu Gin Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr Gin 
1505 1510 . 1515 

aac eaa ccc etc acc eac ttc gtc etc tac tec age gee gcc gcc ace 4607 

Asn Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala Thr 

1520 1525 1530 1535 
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etc ggc age ccc ggc caa gcc aac tac gcc gcc gcc aac gcc ttc etc 4655 
Leu Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Phe Leu 
1540 1545 1550 

gac gcc etc gcc acc eac cge cac acc eaa gga caa ccc gcc acc acc 4703 
Asp Ala Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr Thr 
1555 1560 1565 

ate gcc tgg ggc atg tgg cac ace acc acc aca etc acc age caa etc 4751 
lie Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin Leu 
1570 1575 1580 

ace gac age gac cge gac cge ate cge cge ggc ggc ttc etg ccg ate 4 7 99 
Thr Asp Ser Asp Arg Asp Arg lie Arg Arg Gly Gly Phe Leu Pro lie 
1585 1590 1595 

teg gac gac gag ggc atg e 4 818 

Ser Asp Asp Glu Gly Met 
1600 1605 



<210> 33 
<211> 1605 



£ <212> PRT 

Q <213> Artificial Sequence - 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

Q <400> 33 

SI Met Arg Leu Tyr Glu Ala Ala Arg Arg Thr Gly Ser Pro Val Val Val 

fU 1 5 10 15 

\l 

Q Ala Ala Ala Leu Asp Asp Ala Pro Asp Val Pro Leu Leu Arg Gly Leu 

U 20 25 30 

Arg Arg Thr Thr Val Arg Arg Ala Ala Val Arg Glu Arg Ser Leu Ala 
35 ■ 40 45 

Asp Arg Ser Pro Cys Cys Pro Thr Thr Ser Ala Pro Thr Pro Pro Ser 
50 55 60 

Arg Ser Ser Trp Asn Ser Thr Ala Thr Val Leu Gly His Leu Gly Ala 
65 70 75 80 

Glu Asp lie Pro Ala Thr Thr Thr Phe Lys Glu Leu Gly lie Asp Ser 
85 90 95 

Leu Thr Ala Val Gin Leu Arg Asn Ala Leu Thr Thr Ala Thr Gly Val 
100 105 110 

Arg Leu Asn Ala Thr Ala Val Phe Asp Phe Pro Thr Pro Arg Ala Leu 
115 120 125 

Ala Ala Arg Leu Gly Asp Glu Leu Ala Gly Thr Arg Ala Pro Val Ala 
130 135 140 
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Ala Arg Thr Ala Ala Thr Ala Ala Ala His Asp Glu Pro Leu Ala lie 
145 150 155 160 

Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Ala Ser Pro Gin Glu 
165 170 175 

Leu Trp Arg Leu Val Ala Ser Gly Thr Asp Ala lie Thr Glu Phe Pro 
180 185 190 

Ala Asp Arg Gly Trp Asp Val Asp Ala Leu Tyr Asp Pro Asp Pro Asp 
195 200 205 

Ala lie Gly Lys Thr Phe Val Arg His Gly Gly Phe Leu Asp Gly Ala 
210 215 220 

Thr Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro Arg Glu Ala Leu 
225 230 235 240 

Ala Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser Trp Glu Ala 
245 250 255 



"'T Phe Glu Ser Ala Gly lie Thr Pro Asp Ala Ala Arg Gly Ser Asp Thr 
260 265 270 

=P 

Q Gly Val Phe lie Gly Ala Phe Ser Tyr Gly Tyr Gly Thr Gly Ala Asp 
W 275 280 285 

M» 

Thr Asn Gly Phe Gly Ala Thr Gly Ser Gin Thr Ser Val Leu Ser Gly 

^. 290 .295 300 

Q 

^ Arg Leu Ser Tyr Phe Tyr Gly Leu Glu Gly Pro Ser Val Thr Val Asp 
^ 305 310 315 320 

Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gin Ala Gly Gin Ser 
Q 325 330 335 

Leu Arg Ser Gly Glu Cys Ser Leu Ala Leu Val Gly Gly Val Thr Val 
340 345 350 

Met Ala Ser Pro Gly Gly Phe Val Glu Phe Ser Arg Gin Arg Gly Leu 
355 360 365 

Ala Pro Asp Gly Arg Ala Lys Ala Phe Gly Ala Gly Ala Asp Gly Thr 
370 375 380 

Ser Phe Ala Glu Gly Ala Gly Ala Leu Val Val Glu Arg Leu Ser Asp 
385 390 395 400 

Ala Glu Arg His Gly His Thr Val Leu Ala Leu Val Arg Gly Ser Ala 
405 410 415 

Ala Asn Ser Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro 
420 425 430 



Ser Gin Glu Arg Val lie His Gin Ala Leu Ala A&n Ala Lys Leu Thr 
435 440 445 
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Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu 
450 455 460 

Gly Asp Pro lie Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Asp 
465 470 475 480 

Arg Ala Thr Pro Leu Leu Leu Gly Ser Leu Lys Ser Asn lie Gly His 
485 490 495 

Ala Gin Ala Ala Ser Gly Val Ala Gly lie lie Lys Met Val Gin Ala 
500 505 510 

lie Arg His Gly Glu Leu Pro Pro Thr Leu His Ala Asp Glu Pro Ser 
515 520 525 

Pro His Val Asp Trp Thr Ala Gly Ala Val Glu Leu Leu Thr Ser Ala 
530 535 540 

Arg Pro Trp Pro Gly Thr Gly Arg Pro Arg Arg Ala Ala Val Ser Ser 
545 550 555 560 

Phe Gly Val Ser Gly Thr Asn Ala His lie lie Leu Glu Ala Gly Pro 
565 570 575 

Val Lys Thr Gly Pro Val Glu Ala Gly Ala lie Glu Ala Gly Pro Val 
580 585 590 

Glu Val Gly Pro Val Glu Ala Gly Pro Leu Pro Ala Ala Pro Pro Ser 
595 600 605 

Ala Pro Gly Glu Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu 
610 615 620 

. Ala Leu Asp Glu Gin lie Gly Arg Leu Arg Ala Tyr Leu Asp Thr Gly 
625 630 635 640 

Pro Gly Val Asp Arg Ala Ala Val Ala Gin Thr Leu Ala Arg Arg Thr 
645 650 655 

His Phe Thr His Arg Ala Val Leu Leu Gly Asp Thr Val lie Gly Ala 
660 665 670 

Pro Pro Ala Asp Gin Ala Asp Glu Leu Val Phe Val Tyr Ser Gly Gin 
675 680 685 

Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Asp Ser Ser Val 
690 695 700 

Val Phe Ala Glu Arg Met Ala Glu Cys Ala Ala Ala Leu Arg Glu Phe 
705 710 715 720 

Val Asp Trp Asp Leu Phe Thr Val Leu Asp Asp Pro Ala Val Val Asp 
725 730 735 

Arg Val Asp Val Val Gin Pro Ala Ser Trp Ala Met Met Val Ser Leu 
740 745 750 
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Ala Ala Val Trp Gin Ala Ala Gly Val Arg Pro Asp Ala Val lie Gly 
755 760 765 

His Ser Gin Gly Glu lie Ala Ala Ala Cys Val Ala Gly Ala Val Ser 
770 775 780 

Leu Arg Asp Ala Ala Arg lie Val Thr Leu Arg Ser Gin Ala lie Ala 
785 790 795 800 

Arg Gly Leu Ala Gly Arg Gly Ala Met Ala Ser Val Ala Leu Pro Ala 
805 810 815 

Gin Asp Val Glu Leu Val Asp Gly Ala Trp lie Ala Ala His Asn Gly 
820 825 830 

Pro Ala Ser Thr Val lie Ala Gly Thr Pro Glu Ala Val Asp His Val 
835 840 845 

Leu Thr Ala His Glu Ala Gin Gly Val Arg Val Arg Arg lie Thr Val 
^ 850 855 860 

Asp Tyr Ala Ser His Thr Pro His Val Glu Leu lie Arg Asp Glu Leu 
865 870 875 880 

Q Leu Asp lie Thr Ser Asp Ser Ser Ser Gin Thr Pro Leu Val Pro Trp 
yj 885 890 895 

g) Leu Ser Thr Val Asp Gly Thr Trp Val Asp Ser Pro Leu Asp Gly Glu 
^ 900 905 910 

^ Tyr Trp Tyr Arg Asn Leu Arg Glu Pro Val Gly Phe His Pro Ala Val 



915 920 925 

§y 

Ser Gin Leu Gin Ala Gin Gly Asp Thr Val Phe Val Glu Val Ser Ala 
O 930 935 940 

Ser Pro Val Leu Leu Gin Ala Met Asp Asp Asp Val Val Thr Val Ala 
945 950 955 960 

Thr Leu Arg Arg Asp Asp Gly Asp Ala Thr Arg Met Leu Thr Ala Leu 
965 970 975 

Ala Gin Ala Tyr Val His Gly Val Thr Val Asp Trp Pro Ala lie Leu 
980 985 990 

Gly Thr Thr Thr Thr Arg Val Leu Asp Leu Pro Thr Tyr Ala Phe Gin 
995 1000 1005 

His Gin Arg Tyr Trp Leu Glu Ser Ala Pro Pro Ala Thr Ala Asp Ser 
1010 1015 1020 

Gly His Pro Val Leu Gly Thr Gly Val Ala Val Ala Gly Ser Pro Gly 
025 1030 1035 104 

Arg Val Phe Thr Gly Pro Val Pro Ala Gly Ala Asp Arg Ala Val Phe 
1045 1050 1055 
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lie Ala Glu Leu Ala Leu Ala Ala Ala Asp Ala Thr Asp Cys Ala Thr 
1060 1065 1070 

Val Glu Gin Leu Asp Val Thr Ser Val Pro Gly Gly Ser Ala Arg Gly 
1075 1080 1085 

Arg Ala Thr Ala Gin Thr Trp Val Asp- Glu Pro Ala Ala Asp Gly Arg 
1090 1095 1100 

Arg Arg Phe Thr Val His Thr Arg Val Gly Asp Ala Pro Trp Thr Leu 
105 1110 1115 112 

His Ala Glu Gly Val Leu Arg Pro Gly Arg Val Pro Gin Pro Glu Ala 
1125 1130 1135 

Val Asp Thr Ala Trp Pro Pro Pro Gly Ala Val Pro Ala Asp Gly Leu 
1140 1145 1150 

Pro Gly Ala Trp Arg Arg Ala Asp Gin Val Phe Val Glu Ala Glu Val 
1155 1160 1165 

Asp Ser Pro Asp Gly Phe Val Ala His Pro Asp Leu Leu Asp Ala Val 
1170 1175 1180 

Phe Ser Ala Val Gly Asp Gly Ser Arg Gin Pro Thr Gly Trp Arg Asp 
185 1190 1195 120 

Leu Ala Val His Ala Ser Asp Ala Thr Val Leu Arg Ala Cys Leu Thr 
1205 1210 1215 

Arg Arg Asp Ser Gly Val Val Glu Leu Ala Ala Phe Asp Gly Ala Gly 
1220 1225 1230 

Met Pro Val Leu Thr Ala Glu Ser Val Thr Leu Gly Glu Val Ala Ser 
1235 1240 1245 

Ala Gly Gly Ser Asp Glu Ser Asp Gly Leu Leu Arg Leu Glu Trp Leu 
1250 1255 1260 

Pro Val Ala Glu Ala His Tyr Asp Gly Ala Asp Glu Leu Pro Glu Gly 
265 1270 1275 128 

Tyr Thr Leu He Thr Ala Thr His Pro Asp Asp Pro Asp Asp Pro Thr 
1285 1290 1295 

Asn Pro His Asn Thr Pro Thr Arg Thr His Thr Gin Thr Thr Arg Val 
1300 1305 1310 

Leu Thr Ala Leu Gin His His Leu He Thr Thr Asn His Thr Leu He 
1315 1320 1325 

Val His Thr Thr Thr Asp Pro Pro Gly Ala Ala Val Thr Gly Leu Thr 
1330 1335 1340 

Arg Thr Ala Gin Asn Glu His Pro Gly Arg He His Leu He Glu Thr 
345 1350 1355 136 



139 



His His Pro His Thr Pro Leu Pro Leu Thr Gin Leu Thr Thr Leu His 
1365 1370 1375 

Gin Pro His Leu Arg Leu Thr Asn Asn Thr Leu His Thr Pro His Leu 
1380 1385 1390 

Thr Pro lie Thr Thr His His Asn Thr Thr Thr Thr Thr Pro Asn Thr 
1395 ' 1400 1405 

Pro Pro Leu Asn Pro Asn His Ala lie Leu lie Thr Gly Gly Ser Gly 
1410 1415 1420 

Thr Leu Ala Gly lie Leu Ala Arg His Leu Asn His Pro His Thr Tyr 
425 1430 1435 144 

Leu Leu Ser Arg Thr Pro Pro Pro Pro Thr Thr Pro Gly Thr His lie 
1445 1450 1455 

Pro Cys Asp Leu Thr Asp Pro Thr Gin lie Thr Gin Ala Leu Thr His 
1460 1465 1470 

lie Pro Gin Pro Leu Thr Gly lie Phe His Thr Ala Ala Thr Leu Asp 
1475 1480 1485 

Asp Ala Thr Leu Thr Asn Leu Thr Pro Gin His Leu Thr Thr Thr Leu 
1490 1495 . 1500 

Gin Pro Lys Ala Asp Ala Ala Trp His Leu His His His Thr Gin Asn 
505 . 1510 1515 152 

Gin Pro Leu Thr His Phe Val Leu Tyr Ser Ser Ala Ala Ala Thr Leu 
1525 1530 1535 

Gly Ser Pro Gly Gin Ala Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp 
1540 1545 1550 

Ala Leu Ala Thr His Arg His Thr Gin Gly Gin Pro Ala Thr Thr lie 
1555 1560 1565 

Ala Trp Gly Met Trp His Thr Thr Thr Thr Leu Thr Ser Gin Leu Thr 
1570 1575 1580 

Asp Ser Asp Arg Asp Arg lie Arg Arg Gly Gly Phe Leu Pro lie Ser 
585 1590 1595 160 

Asp Asp Glu Gly Met 
1605 



<210> 34 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 
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<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 34 

ggc cgt ccg cgc cgt gcg gcg gtc teg teg ttc 33 
Gly Arg Pro Arg Arg Ala Ala Val Ser Ser Phe 
15 10 



<210> 35 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

^ <400> 35 

^ Gly Arg Pro Arg Arg Ala Ala Val Ser Ser Phe 
1 5 10 

Q <210> 36 
yj <211> 33 
<212> DNA 

<213> Artificial Sequence 



pa, 
fll 



<220> 

^ <223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

N <22 0> 

Q <221> CDS 

<222> (1) . . (33) 

<400> 36 

acc cag cat ccc gcg atg ggt gag egg etc gcc 3 3 

Thr Gin His Pro Ala Met Gly Glu Arg Leu Ala 
15 10 



<210> 37 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 37 

Thr Gin His Pro Ala Met Gly Glu Arg Leu Ala 
15 10 
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<210> 38 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 38 

tac gcc ttc cag egg egg ccc tac tgg ate gag 
Tyr Ala Phe Gin Arg Arg Pro Tyr Trp lie Glu 
15 10 



<210> 39 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 39 

Tyr Ala Phe Gin Arg Arg Pro Tyr Trp lie Glu 
1 5 10 



<210> 40 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> * 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 40 

gac egg ccc cgt egg gcg ggc gtg teg tec ttc 
Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe 
1 5 10 



<210> 41 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 41 

Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe 
15 10 



<210> 42 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment • 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 42 

tgg cag tgg ctg ggg atg ggc agt gcc ctg egg 
Trp Gin Trp Leu Gly Met Gly Ser Ala Leu Arg 
15 10 



<210> 43 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 43 

Trp Gin Trp Leu Gly Met Gly Ser Ala Leu Arg 
1 * 5 10 



<210> 44 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 44 

tac gcc ttc caa cac cag egg tac tgg gtc gag 
Tyr Ala Phe Gin His Gin Arg Tyr Trp Val Glu 
15 10 
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<210> 45 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 45 

Tyr Ala Phe Gin His Gin Arg Tyr Trp Val Glu 
1 5 10 



<210> 46 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 46 

ggc cga gcg cgc egg gca ggc gtg teg tec ttc 
Gly Arg Ala Arg Arg Ala Gly Val Ser Ser Phe 
15 10 



<210> 47 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 47 

Gly Arg Ala Arg Arg Ala Gly Val Ser Ser Phe 
15 10 



<210> 48 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 
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<220> 
<221> CDS 
<222> (1) , . (33) 

<400> 48 

teg cag cgt get ggc atg ggt gag gaa etg gee 

Ser Gin Arg Ala Gly Met Gly Glu Glu Leu Ala 

15 10 



<210> 49 
<211> 11 
<212> PRT 

<213> Artifieial Sequence 
<220> 

<223> Description of Artifieial Sequence: Synthetic PKS 
synthase fragment 

<400> 49 

Ser Gin Arg Ala Gly Met Gly Glu Glu Leu Ala 

^ . 1 5 10 

•JS <210> 50 
n <211> 33 
2 <212> DNA 

<213> Artificial Sequence 

<220> 

^ <223> Description of Artificial Sequence: DNA encoding synthetic 
g PKS synthase fragment 

fU <220> 

\l <221> CDS 

g <222> (1) . . (33) 

<400> 50 

tac gcc ttc cag cac cag cgc tae tgg etc gag 33 

Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu 
15 10 



<210> 51 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 51 

Tyr Ala Phe Gin His Gin Arg Tyr Trp Leu Glu 
1 5 10 
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<210> 52 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 52 

gcg cga ccg cgc egg gcg ggg gtc teg teg ttc 3 3 

Ala Arg Pro Arg Arg Ala Gly Val Ser Ser Phe 
15 10 



^ <210> 53 
U <211> 11 

<212> PRT 
=B <213> Artificial Secjuence 

=P. 

Q <220> 

y <223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

ffl 

■ <400> 53 

Ala Arg Pro Arg Arg Ala Gly Val Ser Ser Phe 
S 1 5 10 



<210> 54 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 54 

tgg cag tgg gcg ggc atg gcc gtc gac ctg etc 33 
Trp Gin Trp Ala Gly Met Ala Val Asp Leu Leu 
15 10 



<210> 55 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 55 

Trp Gin Trp Ala Gly Met Ala Val Asp Leu Leu 
1 .5 10 

<210> 56 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 56 

tac ccg ttc cag cgc gag cgc gtc tgg etc gaa 
Tyr Pro Phe Gin Arg Glu Arg Val Trp Leu Glu 
15 10 

<210> 57 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 57 

Tyr Pro Phe Gin Arg Glu Arg Val Trp Leu Glu 
15 10 

<210> 58 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 
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<400> 58 

gac ggg gtg cgc egg gca ggt gtg teg gcg ttc 33 
Asp Gly Val Arg Arg Ala Gly Val Ser Ala Phe 
15 10 



<210> 59 
, <211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 59 

^al Arg Arg Ala Gly Val Ser Ala Phe 

10 





1 


5 




<210> 


60 


2 


<211> 


33 




<212> 


DNA 




<213> 


Artificial Sequence 


0 






W 


<220> 


Description of Artifi< 




<223> 


m- 




PKS synthase fragment 


s 


<220> 






<221> 


CDS 


m 


<222> 


(1) . . (33) 


m 








<400> 


60 


a 


gcc cag tgg gaa ggc atg gcg 




Ala Gin Trp Glu Gly Met Ala 




1 


5 




<210> 


61 




<211> 


11 




<212> 


PRT 




<213> 


Artificial Sequence 



DNA encoding synthetic 



:tg ttg 33 
jeu Leu 
10 



<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 61 

Ala Gin Trp Glu Gly Met Ala Arg Glu Leu Leu 
1 5 10 



<210> 62 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



# 
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<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (33) 

<400> 62 

tat cct ttc cag ggc aag egg ttc tgg ctg ctg 33 

Tyr Pro Phe Gin Gly Lys Arg Phe Trp Leu Leu 
1 5 10 



1^ 



m 



<210> 63 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 63 

Tyr Pro Phe Gin Gly Lys Arg Phe Trp Leu Leu 
15 10 



<210> 64 
<211> 480 
y <212> DNA 

<213> Artificial Sequence 



m 

%i <220> 

Q <223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 

<221> CDS 

<222> (3) . . (479) 

<400> 64 

cc ggc gcc gtc gaa ctg ctg acg teg gcc egg ccg tgg ccc gag acc 47 

Gly Ala Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr 
1 5 10 15 

gac egg eca egg cgt gcc gcc gtc tec teg ttc ggg gtg age ggc acc 95 
Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val Ser Gly Thr 
20 25 30 

aac gcc cae gtc ate ctg gag gcc gga ccg gta acg gag acg ccc geg 143 
Asn Ala His Val lie Leu Glu Ala Gly Pro Val Thr Glu Thr Pro Ala 
35 40 45 



gea teg cct tec ggt gac ett ccc ctg ctg gtg teg gea cgc tea ccg 
Ala Ser Pro Ser Gly Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro 
50 55 60 



191 



# 
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gaa gcg etc gac gag cag ate cgc cga etg cge gee tac ctg gae aee 23 9 
Glu Ala Leu Asp Glu Gin lie Arg Arg Leu Arg Ala Tyr Leu Asp Thr 
65 70 75 

aec ccg gac gtc gac egg gtg gee gtg gca cag acg ctg gcc egg cge 2 87 
Thr Pro Asp Val Asp Arg Val Ala Val Ala Gin Thr Leu Ala Arg Arg 
80 85 90 95 

aca cac ttc gee eac cge gee gtg ctg etc ggt gac ace gtc ate ace 335 
Thr His Phe Ala His Arg Ala Val Leu Leu Gly Asp Thr Val lie Thr 
100 105 110 

aca cec ecc gcg gae egg ccc gae gaa etc gtc ttc gtc tac tec ggc 3 83 
Thr Pro Pro Ala Asp Arg Pro Asp Glu Leu Val Phe Val Tyr Ser Gly 
115 120 125 

cag gge ace cag cat cec gcg atg ggc gag cag etc gee gee gee eat 431 
Gin Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala His 
130 135 140 

cec gtg ttc gee gac gee tgg eat gaa gcg etc cgc cgc ctt gac aac e 480 
Pro Val Phe Ala Asp Ala Trp His Glu Ala Leu Arg Arg Leu Asp Asn 
145 150 155 



yj <210> 65 

<211> 159 
<212>. PRT 

<213> Artificial Sequence 



D 

m 



<220> 

^ <223> Description of Artificial Sequence: Synthetic PKS 

synthase fragment 

O <400> 65 

Gly Ala Val Glu Leu Leu Thr Ser Ala Arg Pro Trp Pro Glu Thr Asp 
15 10 15 

Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val Ser Gly Thr Asn 
20 * 25 30 

Ala His Val lie Leu Glu Ala Gly Pro Val Thr Glu Thr Pro Ala Ala 
35 40 45 

Ser Pro Ser Gly Asp Leu Pro Leu Leu Val Ser Ala Arg Ser Pro Glu 
50 55 60 

Ala Leu Asp Glu Gin lie Arg Arg Leu Arg Ala Tyr Leu Asp Thr Thr 
65 70 75 80 

Pro Asp Val Asp Arg Val Ala Val Ala Gin Thr Leu Ala Arg Arg Thr 
85 90 95 

His Phe Ala His Arg Ala Val Leu Leu Gly Asp Thr Val lie Thr Thr 
100 105 110 
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Pro Pro Ala Asp Arg Pro Asp Glu Leu Val Phe Val Tyr Ser Gly Gin 
115 120 125 

Gly Thr Gin His Pro Ala Met Gly Glu Gin Leu Ala Ala Ala His Pro 
130 135 140 

Val Phe Ala Asp Ala Trp His Glu Ala Leu Arg Arg Leu Asp Asn 
145 150 155 



<210> 66 
<211> 120 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence; DNA encoding synthetic 
PKS synthase fragment 



m 

E 

O 

m 

fU 



<220> 

<221> CDS 

<222> (3) . . (119) 



^ <400> 66 

O tc etc ggg get ggg tea egg cac gac gcg gat gtg cce gcg tac gcg 4 7 

y Leu Gly Ala Gly Ser Arg His Asp Ala Asp Val Pro Ala Tyr Ala 

1 . 5 10 15 



ttc caa egg egg cac tac tgg ate gag teg gca cgc ccg. gee gca tee .95 
Phe Gin Arg Arg His Tyr Trp lie Glu Ser Ala Arg Pro Ala Ala Ser 
20 25 30 



gac gcg ggc cac cce gtg ctg ggc t 120 
Asp Ala Gly His Pro Val Leu Gly 
Q 35 



<210> 67 

<211> 39 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 67 

Leu Gly Ala Gly Ser Arg His Asp Ala Asp Val Pro Ala Tyr Ala Phe 
1 5 10 15 

Gin Arg Arg His Tyr Trp lie Glu Ser Ala Arg Pro Ala Ala Ser Asp 
20 . 25 30 

Ala Gly His Pro Val Leu Gly 
35 
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a 



<210> 68 
<211> 480 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 

<221> CDS 

<222> (1) . . (480) 

<400> 68 

teg gcc agg ccg tgg ccg egg acc ggc cgt ccg cgc cgt gcg gcg gtc 4 8 
Ser Ala Arg Pro Trp Pro Arg Thr Gly Arg Pro Arg Arg Ala Ala Val 
15 10 15 

teg teg ttc ggg gtg age ggc acc aac gee cae ate ate ctg gag gee 96 
Ser Ser Phe Gly Val Ser Gly Thr Asn Ala His He He Leu Glu Ala 
20 25 30 



gga cce gae cag gag gag ccg teg gca gaa ccg gee ggt gae etc ccg 144 

Gly Pro Asp Gin Glu Glu Pro Ser Ala Glu Pro Ala Gly Asp Leu Pro 

4^ 35 40 45 

P 

yj ctg etc gtg teg gca egg tec ccg gag gca ctg gae gag cag ate ggg 192 

Leu Leu Val Ser Ala Arg Ser Pro Glu Ala Leu Asp Glu Gin He Gly 

m ' 50 - 55 60 



cgc ctg cgc gae tat etc gae gcc gee cce ggc gtg gae ctg gcg gcc 24 0 
Arg Leu Arg Asp Tyr Leu Asp Ala Ala Pro Gly Val Asp Leu Ala Ala 
65 70 75 80 



]M gtg gcg egg aca ctg gcc acg cgt acg cae ttc tec cae cgc gee gta 
p Val Ala Arg Thr Leu Ala Thr Arg Thr His Phe Ser His Arg Ala Val 
H 90 95 



288 



ctg etc ggt gae acc gtc ate acc get cce cce gtg gaa cag ccg ggc 336 
Leu Leu Gly Asp Thr Val He Thr Ala Pro Pro Val Glu Gin Pro Gly 
100 105 110 

gag etc gtc ttc gtc tac teg gga cag ggc acc cag eat cce gcg atg 384 
Glu Leu Val Phe Val Tyr Ser Gly Gin Gly Thr Gin His Pro Ala Met 
115 120 125 

.ggt gag egg etc gcc gca gcc ttc cce gtg ttc gee gae ccg gae gta 43 2 
Gly Glu Arg Leu Ala Ala Ala Phe Pro Val Phe Ala Asp Pro Asp Val 
130 135 140 

ecc gee tac gcc ttc eag egg egg cce tac tgg ate gag tec gcg ccg 480 
Pro Ala Tyr Ala Phe Gin Arg Arg Pro Tyr Trp He Glu Ser Ala Pro 
145 150 155 160 



<210> 69 
<211> 160 
<212> PRT 



4 » 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 69 

Ser Ala Arg Pro Trp Pro Arg Thr Gly Arg Pro Arg Arg Ala Ala Val 
15 10 15 

Ser Ser Phe Gly Val Ser Gly Thr Asn Ala His He He Leu Glu Ala 
20 25 30 

Gly Pro Asp Gin Glu Glu Pro Ser Ala Glu Pro Ala Gly Asp Leu Pro 
35 40 45 

Leu Leu Val Ser Ala Arg Ser Pro Glu Ala Leu Asp Glu Gin He Gly 
50 55 60 

Arg Leu Arg Asp Tyr Leu Asp Ala Ala Pro Gly Val Asp Leu Ala Ala 
65 70 75 80 

Val Ala Arg Thr Leu Ala Thr Arg Thr His Phe Ser His Arg Ala Val 
85 90 95 



l^i Leu Leu Gly Asp Thr Val He Thr Ala Pro Pro Val Glu Gin Pro Gly 
id 100 105 110 



Glu Leu Val Phe Val Tyr Ser Gly Gin Gly Thr Gin His Pro Ala Met 
115 120 125 



^ Gly Glu Arg Leu Ala Ala Ala Phe Pro Val Phe Ala Asp Pro Asp Val 
^ 130 135 140 

fU, 

'Si Pro Ala Tyr Ala Phe Gin Arg Arg Pro Tyr Trp He Glu Ser Ala Pro 
a 145 150 155 160 

<210> 70 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA encoding synthetic 
PKS synthase fragment 

<220> 
<221> CDS 
<222> (1) . . (60) 

<400> 70 

gac ccg gac gta ccc gcc tac gcc ttc cag egg egg ccc tac tgg ate 48 

Asp Pro Asp Val Pro Ala Tyr Ala Phe Gin Arg Arg Pro Tyr Trp He 
15 10 15 
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gag tec gcg ccg 
Glu Ser Ala Pro 
20 



<210> 71 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic PKS 
synthase fragment 

<400> 71 

Asp Pro Asp Val Pro Ala Tyr Ala Phe Gin Arg Arg Pro Tyr 
15 10 

Glu Ser Ala Pro 
20 



